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Preface 


This book covers calculus in two and three variables. It is suitable for a one-semester course, 
normally known as “Vector Calculus”, “Multivariable Calculus”, or simply “Calculus IIT”. 
The prerequisites are the standard courses in single-variable calculus (a.k.a. Calculus I and 
ID. 

I have tried to be somewhat rigorous about proving results. But while it is important for 
students to see full-blown proofs - since that is how mathematics works - too much rigor and 
emphasis on proofs can impede the flow of learning for the vast majority of the audience at 
this level. If I were to rate the level of rigor in the book on a scale of 1 to 10, with 1 being 
completely informal and 10 being completely rigorous, I would rate it as a 5. 

There are 420 exercises throughout the text, which in my experience are more than 
enough for a semester course in this subject. There are exercises at the end of each sec- 
tion, divided into three categories: A, B and C. The A exercises are mostly of a routine 
computational nature, the B exercises are slightly more involved, and the C exercises usu- 
ally require some effort or insight to solve. A crude way of describing A, B and C would be 
“Basy”, “Moderate” and “Challenging”, respectively. However, many of the B exercises are 
easy and not all the C exercises are difficult. 

There are a few exercises that require the student to write his or her own computer pro- 
gram to solve some numerical approximation problems (e.g. the Monte Carlo method for 
approximating multiple integrals, in Section 3.4). The code samples in the text are in the 
Java programming language, hopefully with enough comments so that the reader can figure 
out what is being done even without knowing Java. Those exercises do not mandate the use 
of Java, so students are free to implement the solutions using the language of their choice. 
While it would have been simple to use a scripting language like Python, and perhaps even 
easier with a functional programming language (such as Haskell or Scheme), Java was cho- 
sen due to its ubiquity, relatively clear syntax, and easy availability for multiple platforms. 

Answers and hints to most odd-numbered and some even-numbered exercises are pro- 
vided in Appendix A. Appendix B contains a proof of the right-hand rule for the cross prod- 
uct, which seems to have virtually disappeared from calculus texts over the last few decades. 
Appendix C contains a brief tutorial on Gnuplot for graphing functions of two variables. 

This book is released under the GNU Free Documentation License (GFDL), which allows 
others to not only copy and distribute the book but also to modify it. For more details, see 
the included copy of the GFDL. So that there is no ambiguity on this matter, anyone can 
make as many copies of this book as desired and distribute it as desired, without needing 
my permission. The PDF version will always be freely available to the public at no cost 
(go to http://www.mecmath.net). Feel free to contact me at mcorral@schoolcraft.edu for 
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iv Preface 


any questions on this or any other matter involving the book (e.g. comments, suggestions, 


corrections, etc). I welcome your input. 
Finally, I would like to thank my students in Math 240 for being the guinea pigs for the 
initial draft of this book, and for finding the numerous errors and typos it contained. 


January 2008 MICHAEL CORRAL 
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1 Vectors in Euclidean Space 


1.1 Introduction 


In single-variable calculus, the functions that one encounters are functions of a variable 
(usually x or t) that varies over some subset of the real number line (which we denote by R). 
For such a function, say, y = f(x), the graph of the function f consists of the points (x, y) = 
(x, f(x)). These points lie in the Euclidean plane, which, in the Cartesian or rectangular 
coordinate system, consists of all ordered pairs of real numbers (a,b). We use the word 
“Buclidean” to denote a system in which all the usual rules of Euclidean geometry hold. We 
denote the Euclidean plane by R; the “2” represents the number of dimensions of the plane. 
The Euclidean plane has two perpendicular coordinate axes: the x-axis and the y-axis. 

In vector (or multivariable) calculus, we will deal with functions of two or three variables 
(usually x, y or x,y,z, respectively). The graph of a function of two variables, say, z2 = f(x,y), 
lies in Euclidean space, which in the Cartesian coordinate system consists of all ordered 
triples of real numbers (a,b,c). Since Euclidean space is 3-dimensional, we denote it by RÎ. 
The graph of f consists of the points (x,y,z) = (x,y, f (x,y)). The 3-dimensional coordinate 
system of Euclidean space can be represented on a flat surface, such as this page or a black- 
board, only by giving the illusion of three dimensions, in the manner shown in Figure 1.1.1. 
Euclidean space has three mutually perpendicular coordinate axes (x,y and z), and three 
mutually perpendicular coordinate planes: the xy-plane, yz-plane and xz-plane (see Figure 
1.1.2). 


yz-plane 


Figure 1.1.1 Figure 1.1.2 
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The coordinate system shown in Figure 1.1.1 is known as a right-handed coordinate 
system, because it is possible, using the right hand, to point the index finger in the positive 
direction of the x-axis, the middle finger in the positive direction of the y-axis, and the thumb 
in the positive direction of the z-axis, as in Figure 1.1.83. 


1 ر 


Figure 1.1.3 Right-handed coordinate system 


An equivalent way of defining a right-handed system is if you can point your thumb up- 
wards in the positive z-axis direction while using the remaining four fingers to rotate the 
x-axis towards the y-axis. Doing the same thing with the left hand is what defines a left- 
handed coordinate system. Notice that switching the x- and y-axes in a right-handed 
system results in a left-handed system, and that rotating either type of system does not 
change its “handedness”. Throughout the book we will use a right-handed system. 

For functions of three variables, the graphs exist in 4-dimensional space (i.e. R®), which 
we can not see in our 3-dimensional space, let alone simulate in 2-dimensional space. So 
we can only think of 4-dimensional space abstractly. For an entertaining discussion of this 
subject, see the book by ABBOTT. 

So far, we have discussed the position of an object in 2-dimensional or 3-dimensional space. 
But what about something such as the velocity of the object, or its acceleration? Or the 
gravitational force acting on the object? These phenomena all seem to involve motion and 
direction in some way. This is where the idea of a vector comes in. 


TOne thing you will learn is why a 4-dimensional creature would be able to reach inside an egg and remove the 
yolk without cracking the shell! 
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You have already dealt with velocity and acceleration in single-variable calculus. For 
example, for motion along a straight line, if y = f(t) gives the displacement of an object after 
time t, then dy/dt = f'(t) is the velocity of the object at time t. The derivative f(t) is just a 
number, which is positive if the object is moving in an agreed-upon “positive” direction, and 
negative if it moves in the opposite of that direction. So you can think of that number, which 
was called the velocity of the object, as having two components: a magnitude, indicated 
by a nonnegative number, preceded by a direction, indicated by a plus or minus symbol 
(representing motion in the positive direction or the negative direction, respectively), i.e. 
f(t) = +a for some number a > 0. Then a is the magnitude of the velocity (normally called 
the speed of the object), and the + represents the direction of the velocity (though the + is 
usually omitted for the positive direction). 

For motion along a straight line, i.e. in a 1-dimensional space, the velocities are also con- 
tained in that 1-dimensional space, since they are just numbers. For general motion along a 
curve in 2- or 3-dimensional space, however, velocity will need to be represented by a multi- 
dimensional object which should have both a magnitude and a direction. A geometric object 
which has those features is an arrow, which in elementary geometry is called a “directed line 
segment”. This is the motivation for how we will define a vector. 


Definition 1.1. A (nonzero) vector is a directed line segment drawn from a point P (called 
its initial point) to a point Q (called its terminal point), with P and Q being distinct 
points. The vector is denoted by PQ. Its magnitude is the length of the line segment, 
denoted by |PQ | and its direction is the same as that of the directed line segment. The 
zero vector is just a point, and it is denoted by 0. 


To indicate the direction of a vector, we draw an arrow from its initial point to its terminal 
point. We will often denote a vector by a single bold-faced letter (e.g. v) and use the terms 
“magnitude” and “length” interchangeably. Note that our definition could apply to systems 
with any number of dimensions (see Figure 1.1.4 (a)-(c)). 
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Figure 1.1.4 Vectors in different dimensions 
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A few things need to be noted about the zero vector. Our motivation for what a vector is 
included the notions of magnitude and direction. What is the magnitude of the zero vector? 
We define it to be zero, i.e. || O|l = 0. This agrees with the definition of the zero vector as just 
a point, which has zero length. What about the direction of the zero vector? A single point 
really has no well-defined direction. Notice that we were careful to only define the direction 
of a nonzero vector, which is well-defined since the initial and terminal points are distinct. 
Not everyone agrees on the direction of the zero vector. Some contend that the zero vector 
has arbitrary direction (i.e. can take any direction), some say that it has indeterminate 
direction (i.e. the direction can not be determined), while others say that it has no direction. 
Our definition of the zero vector, however, does not require it to have a direction, and we will 
leave it at that. 

Now that we know what a vector is, we need a way of determining when two vectors are 
equal. This leads us to the following definition. 


Definition 1.2. Two nonzero vectors are equal if they have the same magnitude and the 
same direction. Any vector with zero magnitude is equal to the zero vector. 


By this definition, vectors with the same magnitude and direction but with different initial 
points would be equal. For example, in Figure 1.1.5 the vectors u, v and w all have the same 
magnitude V5 (by the Pythagorean Theorem). And we see that u and w are parallel, since 
they lie on lines having the same slope 3, and they point in the same direction. So u = w, 
even though they have different initial points. We also see that v is parallel to u but points 
in the opposite direction. So u # Vv. 
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Figure 1.1.5 


So we can see that there are an infinite number of vectors for a given magnitude and 
direction, those vectors all being equal and differing only by their initial and terminal points. 
Is there a single vector which we can choose to represent all those equal vectors? The answer 
is yes, and is suggested by the vector w in Figure 1.1.5. 


In the subject of linear algebra there is a more abstract way of defining a vector where the concept of “direction” 
is not really used. See ANTON and RORRES. 
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Unless otherwise indicated, when speaking of “the vector” with a given magnitude and 


direction, we will mean the one whose initial point is at the origin of the coordinate 
system. 


Thinking of vectors as starting from the origin provides a way of dealing with vectors in 
a standard way, since every coordinate system has an origin. But there will be times when 
it is convenient to consider a different initial point for a vector (for example, when adding 
vectors, which we will do in the next section). 

Another advantage of using the origin as the initial point is that it provides an easy cor- 
respondence between a vector and its terminal point. 


Example 1.1. Let v be the vector in R3 whose initial point is at the origin and whose ter- 
minal point is (3,4,5). Though the point (3,4,5) and the vector v are different objects, it is 
convenient to write v = (3,4,5). When doing this, it is understood that the initial point of v 
is at the origin (0,0,0) and the terminal point is (3,4,5). 


P(3,4,5) 
e 


Xx 


(a) The point (3,4,5) (b) The vector (3,4,5) 


Figure 1.1.6 Correspondence between points and vectors 


Unless otherwise stated, when we refer to vectors as v = (a,b) in R or v = (a,b,c) in RÛ, 
we mean vectors in Cartesian coordinates starting at the origin. Also, we will write the zero 
vector 0 in RÛ and RF as (0,0) and (0,0,0), respectively. 

The point-vector correspondence provides an easy way to check if two vectors are equal, 
without having to determine their magnitude and direction. Similar to seeing if two points 
are the same, you are now seeing if the terminal points of vectors starting at the origin 
are the same. For each vector, find the (unique!) vector it equals whose initial point is 
the origin. Then compare the coordinates of the terminal points of these “new” vectors: if 
those coordinates are the same, then the original vectors are equal. To get the “new” vectors 
starting at the origin, you translate each vector to start at the origin by subtracting the 
coordinates of the original initial point from the original terminal point. The resulting point 
will be the terminal point of the “new” vector whose initial point is the origin. Do this for 
each original vector then compare. 
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6 
RS (2,1,5),Q = (3,5,7,R = 


Example 1.2. Consider the vectors PQ and RS in R3, where P = 


(1,-3,-2) and S = (2,1,0). Does PQ = RS? 
Solution: The vector PQ is equal to the vector v with initial point (0, 0, 0) and terminal point 


Q -P =(3,5,7)-(2,1,5(= (38-2,5-1,7-5( = )1,4,2( 
Similarly, RS is equal to the vector w with initial point (0, 0,0) and terminal point S — R = 


(2,1,0) (1,-3,-2( = )2 11 (-8),0-(-2)) = (1,4,2). 
So PQ = v= (1,4,2) and RS = W = (1,4,2). 


“.PQ=RS 
Q 
PQ )3,5,7( 
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Figure 1.1.7 


Recall the distance formula for points in the Euclidean plane 


For points P = (<1, y1), Q = (xs, y2) in R۶, the distance d between P and Q is 


d= ۷ 2 x1) + (y= y1 


By this formula, we have the following result 


For a vector PQ in R with initial point P = (x,,yı) and terminal point 


Q = (x», y2»), the magnitude of PQ is 
|lPQl = ۷) = (۶ + رر = ول(‎ 
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Finding the magnitude of a vector v = (a,b) in R is a special case of formula (1.2) with 
P = (0,0) and Q = (a,b) : 


For a vector v = (a,b) in R, the magnitude of v is: 


vil = Va2 + b2 


To calculate the magnitude of vectors in R3, we need a distance formula for points in 
Euclidean space (we will postpone the proof until the next section): 


Theorem 1.1. The distance d between points P = (%,,y1,2,) and Q = (xs, y2,z») in Rê is: 
d = ۷)2 - 1(۳ +) - )رو‎ + )2 - 2 (1.4 


The proof will use the following result: 


Theorem 1.2. For a vector v = (a,b,c) in RF, the magnitude of v is: 


vil = Va2 + b2 + c2 (1.5( 


Proof: There are four cases to consider: 


Case 1: a= b = c =0. Then v =0, so |lv|| = 0= V02 + 02 + 02 = Va? + b2 + c2. 


Case 2: exactly two of a,b,c are 0. Without loss of generality, we assume that a = b = 0 and 
c # 0 (the other two possibilities are handled in a similar manner). Then v = (0,0,c), which 
is a vector of length |c| along the z-axis. So |lv|l = |c| = Ve = V0 + 02 + c2 = Va + b2 + c2. 


Case 3: exactly one of a,b,c is 0. Without loss of generality, we assume that a = 0, b #0 
and c # 0 (the other two possibilities are handled in a similar manner). Then v = (0,b,c), 
which is a vector in the yz-plane, so by the Pythagorean Theorem we have || v || = Vb + c2 = 
V0 + b2 + c2 = Va2 + b2 + c2. 


Case 4: none of a,b,c are 0. Without loss of generality, we can as- 
sume that a, b,c are all positive (the other seven possibilities are 
handled in a similar manner). Consider the points P = (0,0,0), 
Q = (a,b,c), R = (a,b,0), and S = (a,0,0), as shown in Figure 
1.1.8. Applying the Pythagorean Theorem to the right trian- 
gle APSR gives |PR| = a + b. A second application of the 
Pythagorean Theorem, this time to the right triangle APQR, 
gives llvll = |PQ| = VIPR| +|QR| = Va + b2 + c. 

This proves the theorem. QED Figure 1.1.8 


Q(a, b,c) 
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Example 1.3. Calculate the following: 


(a) The magnitude of the vector PQ in R with P = (-1,2) and Q = (5,5). 
Solution: By formula (1.2), || PQ || = V(5 - (1)) + (5 - 2)2 = V36 +9 = 45 = 3۷5. 


(b) The magnitude of the vector v = (8,3) in RÎ. 
Solution: By formula (1.3), |lv|l = V82 + 32 = 13. 


(c) The distance between the points P = (2,-1,4) and Q = (4,2,3) in RJ. 
Solution: By formula (1.4), the distance d = /(4- 2)2 + (2 —- (-1))2 + (-3 - 4)2 = 
V/4+9+ 49 = (62. 


(d) The magnitude of the vector v = (5,8,2) in RÎ. 
Solution: By formula (1.5), || vl = V5 + 82 + (-2)2 = (25 + 64+ 4 = (93. 


Exercises 
A 


1. Calculate the magnitudes of the following vectors: 
(a) v = (2,-1) (b)v=(2,-1,0) (c(v=(3,2,-2) (dQ) v= (0,0,1) (e) v = (6,4,-4) 


2. For the points P = (1,-1,1), Q = (2,-2,2), R = (2,0,1), S = (3,-1,2), does PQ = RS? 
3. For the points P = (0,0,0), Q = (1,3,2), R = (1,0,1), S = (2,3,4), does PQ = RS? 

B 

4. Let v = (1,0,0) and w = (a, 0,0) be vectors in R3. Show that ll wl = lal II vl|. 

5. Let v= (a,b,c) and w = (3a,3b,3c) be vectors in R3. Show that ll wil = 3 Il vl|. 

C 


6. Though we will see a simple proof of Theorem 1.1 
in the next section, it is possible to prove it using رظ‎ 2,2 2( 
methods similar to those in the proof of Theorem P(e 91,21) 

1.2. Prove the special case of Theorem 1.1 where the 
points P = (x1,71,2,) and Q = (xs, y»,z») satisfy the fol- 
lowing conditions: 

Xx > xı >0, Y, > Y1 > 0, and Zz, > z1 >0. 

(Hint: Think of Case 4 in the proof of Theorem 1.2, 
and consider Figure 1.1.9.) 


R(X, Y2,21)‏ کم 


y 
ا‎ 


X110) 


(%2, 2,0) 


x U(x2,71,0) 


Figure 1.1.9 
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1.2 Vector Algebra 


Now that we know what vectors are, we can start to perform some of the usual algebraic 
operations on them (e.g. addition, subtraction). Before doing that, we will introduce the 
notion of a scalar. 


Definition 1.3. A scalar is a quantity that can be represented by a single number. 


For our purposes, scalars will always be real numbers. Examples of scalar quantities are 
mass, electric charge, and speed (not velocity). We can now define scalar multiplication of 
a vector. 


Definition 1.4. For a scalar F and a nonzero vector v, the scalar multiple of v by RF, 
denoted by kv, is the vector whose magnitude is |A| ll vl, points in the same direction as v if 
Fk >0, points in the opposite direction as v if f < 0, and is the zero vector 0 if kF = 0. For the 
zero vector 0, we define F0 = 0 for any scalar R. 


Two vectors v and w are parallel (denoted by v || w) if one is a scalar multiple of the other. 
You can think of scalar multiplication of a vector as stretching or shrinking the vector, and 
as flipping the vector in the opposite direction if the scalar is a negative number (see Figure 


1.2.1). 
٤ 1 1 ٤ 


Figure 1.2.1 


Recall that translating a nonzero vector means that the initial point of the vector is 
changed but the magnitude and direction are preserved. We are now ready to define the 
sum of two vectors. 


Definition 1.5. The sum of vectors v and w, denoted by v + w, is obtained by translating 
w so that its initial point is at the terminal point of v; the initial point of v + w is the initial 
point of v, and its terminal point is the new terminal point of w. 


3The term scalar was invented by 1وth‎ century Irish mathematician, physicist and astronomer William Rowan 
Hamilton, to convey the sense of something that could be represented by a point on a scale or graduated ruler. 
The word vector comes from Latin, where it means “carrier”. 

4An alternate definition of scalars and vectors, used in physics, is that under certain types of coordinate trans- 
formations (e.g. rotations), a quantity that is not affected is a scalar, while a quantity that is affected (in a 
certain way) is a vector. See MARION for details. 


10 CHAPTER 1. VECTORS IN EUCLIDEAN SPACE 
Intuitively, adding w to v means tacking on w to the end of v (see Figure 1.2.2). 


Ww 


0 W 
Vv Vv 


(a) Vectors v and w (b) Translate w to the end of v (c) The sum v+ w 


Vv 


Figure 1.2.2 Adding vectors v and w 


Notice that our definition is valid for the zero vector (which is just a point, and hence can 
be translated), and so we see that v + 0 = v = 0+ v for any vector v. In particular, 0 + 0 = 0. 
Also, it is easy to see that v + (-v) = 0, as we would expect. In general, since the scalar 
multiple ~v = —1v is a well-defined vector, we can define vector subtraction as follows: 
v-w=v+(-w). See Figure 1.2.3. 


Vv 
Ww V-Ww 
a -W -W 
Vv 
(a) Vectors v and w (b) Translate -w to the end of v (c) The difference v — w 


Figure 1.2.3 Subtracting vectors v and w 


Figure 1.2.4 shows the use of “geometric proofs” of various laws of vector algebra, that is, 
it uses laws from elementary geometry to prove statements about vectors. For example, (a) 
shows that v + w = w + v for any vectors v, w. And (c) shows how you can think of v—-w as 
the vector that is tacked on to the end of w to add up to Vv. 


(a) Add vectors (b) Subtract vectors (ce) Combined add/subtract 


Figure 1.2.4 “Geometric” vector algebra 


Notice that we have temporarily abandoned the practice of starting vectors at the origin. 
In fact, we have not even mentioned coordinates in this section so far. Since we will deal 
mostly with Cartesian coordinates in this book, the following two theorems are useful for 
performing vector algebra on vectors in R® and RF starting at the origin. 
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Theorem 1.3. Let v = (u,,u»), w = (w,,w,») be vectors in R/, and let k be a scalar. Then 
(a) kv = (ku1, ku») 


(b) v + w = (u, +W1,U2 + We?) 


Proof: (a) Without loss of generality, we assume that vu,,u, > 0 (the other possibilities are 
handled in a similar manner). If F = 0 then kv = Ov = 0 = (0,0) = (0u,, 0u») = (ku,,ku»), which 
is what we needed to show. If F # 0, then (ku,, kv») lies on a line with slope 2 = د‎ which 
is the same as the slope of the line on which v (and hence kv) lies, and (ku,,kua) points in 
the same direction on that line as kv. Also, by formula (1.3) the magnitude of (ku,, ku,») is 


V(Ru1) + (Rus) = (2u? + R2? = (R2(2 + 02) = |k| o2 + u} = E1 lvl. So kv and (kv, ks) 


have the same magnitude and direction. This proves (a). 


(þb) Without loss of generality, we assume that 
U1,U2,wW1ı,w, > 0 (the other possibilities are han- 


ف 


dled in a similar manner). From Figure 1.2.5, we w2 

see that when translating w to start at the end of u 

v, the new terminal point of w is (Uı +wW1,Us+ we»), 5 
so by the definition of v + w this must be the ter- 0 w1 U1 U1+w1 
minal point of v + w. This proves (b). QED 


Figure 1.2.5 
Theorem 1.4. Let v = (U,,U»,Us), W = (w,,wa,w;) be vectors in RF, let F be a scalar. Then 
(a) kv = (ku, ku2, Rus) 

(b) v + w = (u, +W1,U2 + W2, Ug + W3) 


The following theorem summarizes the basic laws of vector algebra. 


Theorem 1.5. For any vectors u, v, w, and scalars k,l, we have 


(a v+w=w+v Commutative Law 
(b) u+ (v + w) = (u+ v)+w Associative Law 
((v+0=v=0+v Additive Identity 
(d) v+(-v)= 0 Additive Inverse 
(e) R(Iv) = (kl)v Associative Law 
(OD k(v+w)= kv + kW Distributive Law 
(g) (k+Dv = kv + lv Distributive Law 


Proof: (a) We already presented a geometric proof of this in Figure 1.2.4(a). 


(b) To illustrate the difference between analytic proofs and geometric proofs in vector alge- 
bra, we will present both types here. For the analytic proof, we will use vectors in RF (the 
proof for R is similar). 
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Let u = (U1,Us,U3), V = (U1,U2,U3), W = (W1,W2, ws) be vectors in R3. Then 


U + (V+ W) = (U1, U2, Ug) + ((U1,U2,Ug) + (W1, W2, Wg)) 
= (U1,Ua,U3) + (U + W1, U2 + W2, Ug + Wg) by Theorem 1.4(b) 
= (u, + (U +w1), Us + (U, + wa), Us + (us + wو))‎ by Theorem 1.4(b) 
= (u +Uu1)+W1,(Ua + U2) + Wa, (Us + u4) + ws) by properties of real numbers 
= (U1 +U1,Ua + U2, Ug + Ug) + (W1, W2, W3) by Theorem 1.4(b) 


= (u+v)+w 


This completes the analytic proof of (b). Figure 1.2.6 provides the geometric proof. 
u+ (v+ Ww) = (u+ V)+ WwW 


VHIW 
u utvV 


Vv 
Figure 1.2.6 Associative Law for vector addition 


(c) We already discussed this on p.10. 
(d) We already discussed this on p.10. 


(e) We will prove this for a vector Vv = (U1,Ua,Ug) in RF (the proof for R® is similar): 


k(lv) = Rk(lu,,lua,lus) by Theorem 1.4(a) 
= (klu,,klua, klug) by Theorem 1.4(a) 
= (Rl)(u1,U2,Ug) by Theorem 1.4(a) 
= (kI)v 
(f and (g): Left as exercises for the reader. QED 


A unit vector is a vector with magnitude 1. Notice that for any nonzero vector v, the 


vector TT is a unit vector which points in the same direction as v, since Tî > 0 and || TT l= 
lvl _ 


vl 1. Dividing a nonzero vector v by |l vl is often called normalizing Vv. 

There are specific unit vectors which we will often use, called the basis vectors: 
i = (1,0,0), j = (0,1,0), and k = (0,0,1) in R3; i= (1,0) and j = (0, 1) in R/. 
These are useful for several reasons: they are mutually perpendicular, since they lie on 
distinct coordinate axes; they are all unit vectors: |lill = lljll = |llkll = 1; every vector can 
be written as a unique scalar combination of the basis vectors: v = (a,b) = ai+ bj in R2, 
v= (a,b,c) = ai+ bj +ck in R3. See Figure 1.2.7. 
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2 
V = (a,b,c) 
J ا‎ 
2 v = (a,b) ek 
y 
3 ااا‎ 
bj 0 
J ر‎ 
0 i1 2 O0 ai زط‎ 
(a) R2 (b) v= ai + bj (e) RÛ (@ v=ai+bj+cek 


Figure 1.2.7 Basis vectors in different dimensions 


When a vector v = (a,b,c) is written as v = ai+ bj + ck, we say that v is in component 
form, and that a, b, and c are the i, jJ, and k components, respectively, of v. We have: 
VvV=uرi+u,j+uşk,k‎ a scalar => kv = ku,i+ ku, j + kuş k 


V=uرi+u,]‎ + ugk, w = w,1+ wa] + w4 lk => v + w = (u, + w1) + (vu, + w2) + (Ug + w4)k 


وا + v7 + v7‏ = || || جد وں + زوں + 1ں = ۷ 


Example 1.4. Let v = (2,1,-1) and w = (3,-4,2) in RÎ. 


(a) Find vw. 
Solution: vw = (2-3,1 (-4),-1-2) = (-1,5,-8( 


(b) Find 3v + 2w. 
Solution: 3v + 2w = (6,3,-3) + (6, -8,4) = (12,-5,1( 


(c) Write v and w in component form. 
Solution: v = 2i+j—k, w = 3i-4j+ 2k 


(d) Find the vector u such that u+ v = w. 
Solution: By Theorem 1.5, u= w-v = -(yv—-w) = —(—-1,5,-83) = (1,5,3), by part(a). 


(e) Find the vector u such that u + v + w =0. 
Solution: By Theorem 1.5, u= -w-v = -—(3,-4,2)—-(2,1,-1)= (-5,3,-1). 


( Find the vector u such that 2u +i— 2j = k. 
Solution: 2u = -i+ 2j +k => u= -}i+j +k 
(g) Find the unit vector TI: 


n 1 ANS E 
Solution: 7 ت‎ (2,1, 1) = 


1 
/22+12+(-17 V6? V6? 2) 
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We can now easily prove Theorem 1.1 from the previous section. The distance d between 
two points P = (<1,Y1,2,) and Q = (%2, y2, 2») in RF is the same as the length of the vector w—v, 
where the vectors v and w are defined as Vv = (xı,y1,2,) and w = (xs, y»,z2») (see Figure 1.2.8). 
So since w-v = (x, %1, = Y1,22- 2), then d = [lw -v|| = V(x — x1) + (ya — y1) + (2, — 2,2 by 


Theorem 1.2. 
P(%1,71,21) 
Q(x2, 72,22) 
Y 
Figure 1.2.8 Proof of Theorem 1.2: d = ||w = v|| 
Exercises 
A 
1. Let v= (-1,5,—-2) and w = (3,1,1). 
(a) Find v—w. (b) Find v+ w. (c) Find TI (dQ) Find | 1(v — Ww) | 


(e) Find | 1(v+ w)|. (9 Find -2v+4w. (g) Find v-2w. 
(h) Find the vector u such that u+ v + w =i. 

(D Find the vector u such that u + v + w = 2j +k. 

(j) Is there a scalar m such that m(v + 2w) = k? If so, find it. 


2. For the vectors v and w from Exercise 1, is lv—wll = ll vil = llw||? If not, which quantity 
is larger? 

3. For the vectors v and w from Exercise 1, is lv + wll = ll vil + llw||? If not, which quantity 
is larger? 

B 

4. Prove Theorem 1.5(f) for R. 5. Prove Theorem 1.5(g) for Rî. 

C 


6. We know that every vector in R can be written as a scalar combination of the vectors i, 
j, and k. Can every vector in RÛ be written as a scalar combination of just i and j, i.e. for 
any vector v in RF, are there scalars m, n such that v = mi + nj? Justify your answer. 
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1.3 Dot Product 


You may have noticed that while we did define multiplication of a vector by a scalar in the 
previous section on vector algebra, we did not define multiplication of a vector by a vector. 
We will now see one type of multiplication of vectors, called the dot product. 


Definition 1.6. Let v = (v,,u,,us) and w = (wı,wa,w;) be vectors in Rî. 
The dot product of v and w, denoted by vw, is given by: 


VW = UW, + UW; + UŞW3 )1.6( 
Similarly, for vectors v = (v,,vu,) and w = (wı,w») in R2, the dot product is: 


VW = UW, + UW: (1.7) 


Notice that the dot product of two vectors is a scalar, not a vector. So the associative law 
that holds for multiplication of numbers and for addition of vectors (see Theorem 1.5(b),(e)), 
does not hold for the dot product of vectors. Why? Because for vectors u, v, w, the dot 
product u‘ v is a scalar, and so (u‘v)‘w is not defined since the left side of that dot product 
(the part in parentheses) is a scalar and not a vector. 

For vectors v= u,i+u,] + ugk and w = wıi+ wa] + w; k in component form, the dot product 
is still vw = uw, + UW; + UW. 

Also notice that we defined the dot product in an analytic way, i.e. by referencing vector 
coordinates. There is a geometric way of defining the dot product, which we will now develop 
as a consequence of the analytic definition. 


Definition 1.7. The angle between two nonzero vectors with the same initial point is the 
smallest angle between them. 


We do not define the angle between the zero vector and any other vector. Any two nonzero 
vectors with the same initial point have two angles between them: 0 and 360° - 0. We will 
always choose the smallest nonnegative angle 0 between them, so that 0° =< Û0 <= 180°. See 


Figure 1.3.1. 
ِ 0 360° -0 
٣ لğو 0و‎ 
1 \ \ 
360° -0 < ` 360-0 % ر‎ 
(a) 0° <0 < 180° (b) 0 = 180° (ce) 0=0° 


Figure 1.3.1 Angle between vectors 


We can now take a more geometric view of the dot product by establishing a relationship 
between the dot product of two vectors and the angle between them. 
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Theorem 1.6. Let v, w be nonzero vectors, and let 0 be the angle between them. Then 


cosÛ = N )1.8( 
vill wl 


Proof: We will prove the theorem for vectors in RF (the proof for R® is similar). Let v = 
(U1,U2,U4) and WwW = (wı,wa,wو).‎ By the Law of Cosines (see Figure 1.3.2), we have 


lv-wl = lvl + lw? — 2 ll vl || w || cos@ (1.9( 


(note that equation (1.9) holds even for the “degenerate” cases Û = 0° and 180°). 


Figure 1.3.2 


Since V— Ww = (U = W1,U2 — Wa2,Ug ¬ w3), expanding lv wl in equation (1.9) gives 


lvl? + lw — 2l vil lw || cos@ = (u, — w,) + (u, — wa) + (ug — wg) 
ج‎ (u2 — 2U, + w2( + (u2 — UW + w2) + (u2 — 2UgWg + w2) 


2 
= (2 + u2 + 02) + (w2 + w+ w?)— 2(1 + UW: + UW) 


= lvl + llwllÃ - 2(v-w) , so 
-2llvll lw || cosO = -2(v°w) , so since v ¥ 0 and w ¥ 0 then 
VW 


cos0 = , since |lv|| > 0 and |lw|| > 0. QED 
vil wl 


Example 1.5. Find the angle 0 between the vectors v = (2,1,—-1) and w = (3,-4,1). 
Solution: Since vw = (2)(3) + (1)(-4)+ (-1)() = 1, |lv|l = V6, and |lw || = V26, then 


VW 1 1 


85.417 =0 ج 0.08 × س = 
Ivillwll 6۷26 2739‏ 


cosÛ0 = 


Two nonzero vectors are perpendicular if the angle between them is 90°. Since cos90° = 
0, we have the following important corollary to Theorem 1.6: 


We will write v L w to indicate that v and w are perpendicular. 
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Since cosO0 > 0 for 0° < 0 < 90° and cosÛ0 < 0 for 90° < 0 < 180°, we also have: 


By Corollary 1.8, the dot product can be thought of as a way of telling if the angle be- 
tween two vectors is acute, obtuse, or a right angle, depending on whether the dot product 
is positive, negative, or zero, respectively. See Figure 1.3.3. 


Ww Ww Ww 
90° >0 > 180 0- 90° 
0° >0 > 90° 
V V Vv 
(a) vw >0 (b) vw <0 (e) vw =0 


Figure 1.3.3 Sign of the dot product & angle between vectors 


Example 1.6. Are the vectors v = (-1,5,—-2) and w = (3, 1, 1) perpendicular? 
Solution: Yes, v L w since vow = (-1)(3) + (5)(1) + (-2)(1) = 0. 


The following theorem summarizes the basic properties of the dot product. 


Theorem 1.9. For any vectors u, v, w, and scalar k, we have 

(a) vw = w‘v Commutative Law 
(b) (kv)w = v-(kw) = R(vWw) Associative Law 
(c( v۰0=0=0۰v 

(d) u-(v + w) = u-v + uw Distributive Law 
(e) (u+ v)w = uw + v‘W Distributive Law 
Oiv-wl = lvl ll wll Cauchy-Schwarz Inequality 


Proof: The proofs of parts (a)-(e) are straightforward applications of the definition of the 
dot product, and are left to the reader as exercises. We will prove part (f). 

(f) If either v = 0 or w = 0, then vw = 0 by part (c), and so the inequality holds trivially. So 
assume that v and w are nonzero vectors. Then by Theorem 1.6, 


vw =cosO@ ll vll lw || , so 
lv-wl| = lcos0|llvll llwIl , so 


Iv-wli = llvl|l llwil| since |cos@0| < 1. QED 


5Also known as the Cauchy-Schwarz-Buniakovski Inequality. 
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Using Theorem 1.9, we see that if uv = 0 and u-w = 0, then u-(kv+lw) = k(u-v)+l(u-w) = 
R(0) + [(0) = 0 for all scalars k,l. Thus, we have the following fact: 


Ifu Lv and u L w, then u L (kv +Iw) for all scalars k,L. 


For vectors v and w, the collection of all scalar combinations Fv + lw is called the span 
of v and w. If nonzero vectors v and w are parallel, then their span is a line; if they are 
not parallel, then their span is a plane. So what we showed above is that a vector which is 
perpendicular to two other vectors is also perpendicular to their span. 

The dot product can be used to derive properties of the magnitudes of vectors, the most 
important of which is the Triangle Inequality, as given in the following theorem: 


Theorem 1.10. For any vectors v, w, we have 
(a) lvl = vv 
(b) llv+ wll = lvl + || wll Triangle Inequality 


(ce) llv- wll > ll vl = ll wll 


Proof: (a) Left as an exercise for the reader. 


(b) By part (a) and Theorem 1.9, we have 


lv + wll? = (v+ w)-(v + w) = vv + v-w + w-*V + W-*W 

= || v2 + 2)۰ w( + || w۶ , So since a = |a| for any real number a, we have 
< [lvl +2|lv-wl| + llwll , so by Theorem 1.9(f) we have 
< lvl +2lvllwil +lwll = (lvl + lw)” and so 


lIv+wll = llvll + lwll after taking square roots of both sides, which proves (b). 


(c) Since v = w+(v—w), then llvll = llw+(v-w)|l = lwll + ll v—w || by the Triangle Inequality, 
so subtracting || w || from both sides gives |lv|l = llwll = ll v= wl. QED 
The Triangle Inequality gets its name from the fact that in any triangle, YEW 


no one side is longer than the sum of the lengths of the other two sides (see 
Figure 1.3.4). Another way of saying this is with the familiar statement “the ® Ww 


shortest distance between two points is a straight line. Figure 134 


Exercises 
A 


1. Let v= (5,1,-2) and w = (4, -4,3). Calculate vw. 
2. Let v= -3i—-2j—k and w = 6i+ 4j + 2k. Calculate vw. 


For Exercises 3-8, find the angle 0 between the vectors v and w. 


19 


_lv:wl Figure 1.3.5 
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. V=(5,1,-2(, w = )4,-4,3( 4. v= (7,2,-10), w = (2,6,4) 

. V = (2,1,4), w = )1,-2,0( 6. v = (4,2,-1), w = (8,4,-2( 

. V=-1+2j+k, w= -31+ 6j+ 3k 8. v =i, w=3i+2j+ 4k 

. Let v = (8,4,3) and w = (-2,1,4). Is v L w? Justify your answer. 

. Let v = (6,0,4) and w = (0,2,-1). Is v L w? Justify your answer. 

. For v, w from Exercise 5, verify the Cauchy-Schwarz Inequality |vw| = ll v || ll w ||. 
. For v, w from Exercise 6, verify the Cauchy-Schwarz Inequality |lv-w| = ll v || ll w ||. 


. For v, w from Exercise 5, verify the Triangle Inequality llv + wll = ll vil + ll wll. 


For v, w from Exercise 6, verify the Triangle Inequality |lv + wll = ll vil + ll wll. 


14. 


Note: Consider only vectors in RF for Exercises 15-25. 


Prove Theorem 1.9(a). 16. Prove Theorem 1.9(b). 
Prove Theorem 1.9(c). 18. Prove Theorem 1.9(d). 
Prove Theorem 1.9(e). 20. Prove Theorem 1.10(a). 


. Prove or give a counterexample: If uv = uw, then v= w. 


. Prove or give a counterexample: If vw = 0 for all v, then w = 0. 
. Prove or give a counterexample: If uv = u-w for all u, then v = w. 


„ Prove that ivi = lw | = |llv~w|l| for all v, w. 


times written as projwv) is the vector u along the same line L 
as w whose terminal point is obtained by dropping a perpendic- 


ular line from the terminal point of v to L (see Figure 1.3.5). پد‎ Ww 


Show that 


lull = 
ll wll 


(Hint: Consider the angle between v and w.) 


. For nonzero vectors v and w, the projection of v onto w (some- 


15. 
17. 
19. 


26. Let a, Û, and y be the angles between a nonzero vector v in RF and the vectors i, j, and 


k, respectively. Show that cos a + cos 8 + cosy = 1. 


(Note: a, Û, y are often called the direction angles of v, and cosa, cos Û, cosy are called 


the direction cosines.) 
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1.4 Cross Product 


In Section 1.3 we defined the dot product, which gave a way of multiplying two vectors. The 
resulting product, however, was a scalar, not a vector. In this section we will define a product 
of two vectors that does result in another vector. This product, called the cross product, is 
only defined for vectors in R3. The definition may appear strange and lacking motivation, 
but we will see the geometric basis for it shortly. 


Definition 1.8. Let v = (u,,U,,us) and w = (wı,wa,wو)‎ be vectors in R3. The cross product 
of v and w, denoted by v x w, is the vector in RÎ given by: 


VX W = (U2Wg = UW 2, U3W1 = U1 W3, U1 W2 — U2 W1) )1.10( 


Example 1.7. Find i xj. 
Solution: Since i = (1,0,0) and j = (0, 1,0), then 
1 × j = ))0()0( - )0()1(, )0()0( - )1()0(, )1()1( - )0()0(( 


= )0,0,1( 
=k 


Figure 1.4.1 


Similarly it can be shown that j x k =i and k xi=j. 


In the above example, the cross product of the given vectors was perpendicular to both 
those vectors. It turns out that this will always be the case. 


Theorem 1.11. If the cross product v x w of two nonzero vectors v and w is also a nonzero 
vector, then it is perpendicular to both v and w. 


Proof: We will show that (v x w)‘v = 0: 


(V x W)*V = (UWg — UW, UW = UW 3, U W2 =~ UW 1)* (U1, 02,3) 
= U W3U1 e U3Wa2U1 + U3W1Ua i U1W3Ua + U1W2Usg 3 U2 W1Ug 
= U1UaWsg 2 U1U Ws + W1U2Ug ر‎ W1U2Ug + U1Wa2Usg ۰ U1WaUs 


= 0 , after rearranging the terms. 


..V XW JL v by Corollary 1.17. 
The proof that v x w L w is similar. QED 


As a consequence of the above theorem and Theorem 1.9, we have the following: 
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The span of any two nonzero, nonparallel vectors v, w in RF is a plane P, so the above 
corollary shows that v x w is perpendicular to that plane. As shown in Figure 1.4.2, there 
are two possible directions for v x w, one the opposite of the other. It turns out (see Appendix 
B) that the direction of v x w is given by the right-hand rule, that is, the vectors v, w, v x w 
form a right-handed system. Recall from Section 1.1 that this means that you can point your 
thumb upwards in the direction of v x w while rotating v towards w with the remaining four 
fingers. 


VX<W 


پک 


xX —Vx<xW 


Figure 1.4.2 Direction of v x w 


We will now derive a formula for the magnitude of v x w, for nonzero vectors Vv, W: 


2 2 2 2 
lv x wll? = (ugwg — usw») + (usw ~ U13) + (UW ¬ UW 1) 


= vw? — 2UWaUgWg + vw + vw — 2UW1UgWg + vw + vw? — UW UW + vw 


v2(w + w2) + v2 (w2 + w2) + u2(w + w2) — 2(U1W1U2Wa + UW 1UgWg + UW2UWg) 


and now adding and subtracting uw, uw}, and vw on the right side gives 


= س )2ں‎ + w+ w7 ( + )ا‎ + w2 + w2) + 2) w 2 + w7 + (2س‎ 


- (u2 w2 + vw? + vw + 2(U1W1UsW; + U1W1UWg + UW U3W3)) 


(7سw+ u2 + 2)) + w2‏ + 2( ڪ 


(uw) + (uwa) 4 (usw) + 2(u1W1)(Usw2) + 2(U1w1)(UsWa) + 2(usw2)(UsW3))‏ ج 
so using (a + b + ce) = a + b% + c% + 2ab + 2ac + 2bc for the subtracted term gives‏ 
w2)- (Uw + UW» + UW)‏ + رَس + (u + u2 + v2)‏ = 


= | v۶ | w۶ - (v۰ Ww (۶ 


w2 
=| ا - 1 ا‎ , since |lv|l > 0 and |lw || > 0, so by Theorem 1.6 
vil lw 
= [lvl Il wll (1 — cos 0) , where 0 is the angle between v and w, so 


Iv x wil = IlvIlZ Il wl sin@ , and since 0° < 0 < 180°, then sin@ > 0, so we have: 
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If 0 is the angle between nonzero vectors v and w in R®, then 


lv x wil = lvl II wl sin0@ 


It may seem strange to bother with the above formula, when the magnitude of the cross 
product can be calculated directly, like for any other vector. The formula is more useful for 
its applications in geometry, as in the following example. 


Example 1.8. Let APQR and PQRS be a triangle and parallelogram, respectively, as shown 
in Figure 1.4.3. 


Figure 1.4.3 


Think of the triangle as existing in RF, and identify the sides QR and QP with vectors v 
and w, respectively, in R3. Let 0 be the angle between v and w. The area A pPqR Oof APQR is 
Joh, where Öb is the base of the triangle and A is the height. So we see that 


1 . 
APqR = lvl lw sin 
1 
= |v x wl 


2 


So since the area Apqrs of the parallelogram PQRS is twice the area of the triangle APQR, 
then 
ApçgaS = llvll llwll sin@ 


By the discussion in Example 1.8, we have proved the following theorem: 


Theorem 1.13. Area of triangles and parallelograms 


(a) The area A of a triangle with adjacent sides v, w (as vectors in RP) is: 


1 
س س ار 
اا 


(b) The area A of a parallelogram with adjacent sides v, w (as vectors in RF) is: 


A = |lv x wll 
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It may seem at first glance that since the formulas derived in Example 1.8 were for the 
adjacent sides QP and QR only, then the more general statements in Theorem 1.13 that the 
formulas hold for any adjacent sides are not justified. We would get a different formula for 
the area if we had picked PQ and PR as the adjacent sides, but it can be shown (see Exercise 
26) that the different formulas would yield the same value, so the choice of adjacent sides 
indeed does not matter, and Theorem 1.13 is valid. 

Theorem 1.13 makes it simpler to calculate the area of a triangle in 3-dimensional space 
than by using traditional geometric methods. 


Example 1.9. Calculate the area of the triangle APQR, where P = (2,4,-7), Q = (3,7,18), 
and R =(-5,12,8). 


Solution: Let v = PQ and w = PR, as in Figure 1.4.4. Then 
v= (3,7,18)-(2,4,-7( = (1,8,25) and w = (-5, 12, 8(-)2, 4, -7( = 
(7,8,15), so the area A of the triangle APQR is 


2 
Q(3,7,18) 


1 1 
A= 5 lvxwil = 3 (1,3, 25) x (7,8, 15(|| 


1 
و‎ || ((8)(15) - (25)(8), (25)(- 7) - (1)(15), (1(8) - (3(7) | 3  P@24-7) 


1 
ا‎ | «155, -190, 29) || Figure 1.4.4 
1 


1 
60966 = 292 + 190(2-) + 155(2-(7 و 


A x 6 


Example 1.10. Calculate the area of the parallelogram PQRS, where P = (1,1), Q = (2,3), 
R = (5,4), and S = (4,2). 


Solution: Let v = SP and w = SR, as in Figure 1.4.5. Then 
v = (1,1) - (4,2) = (-8,-1) and w = (5,4) ¬ (4,2) = (1,2). But 
these are vectors in R, and the cross product is only defined 
for vectors in R3. However, R can be thought of as the subset 


of RF such that the z-coordinate is always 0. So we can write 
v=(-8,-1,0) and w = (1,2,0). Then the area A of PQRS is 


A = lv x wll = || (-3,-1,0) x (1,2,0) | 
= || ((-1)(0) - (0)(2), (0)(1) — (-8)(0), (-8)(2) - (= 1)10) || 
= | 0,0,-5(| 

A=5 


0 1 2 3 4 5 
Figure 1.4.5 
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The following theorem summarizes the basic properties of the cross product. 


Theorem 1.14. For any vectors u, v, w in RP, and scalar F#, we have 


(av xw = -wxvV Anticommutative Law 
(b)ux(v+w)=uxv+ux Ww Distributive Law 
(c((u+v)xw=uxw+v xw Distributive Law 
(d) (kv) x w = v x (kw) = k(v x w) Associative Law 
(ev x0=0=0xv 

(Ovxv =0 


(g) v xw =0 if and only if v || w 


Proof: The proofs of properties (b)-(f are straightforward. We will prove parts (a) and (g) 
and leave the rest to the reader as exercises. 


(a) By the definition of the cross product and scalar multipli- 
cation, we have: 


VX W = (UzWg ~ UgW2,U3W1 —~ U1 W3, U1 W2 — Ua W1) 
= —(UgWa — UW, U1Wg —~ UW 1, UW 1~ UW») 


= —(WaUg — WU, W3U1 —~ W1 U3, W1 U2 — W2 U1) 


= -WXV 


Xx WXV 


Note that this says that v x w and w x v have the same mag- Figure 1.4.6 


nitude but opposite direction (see Figure 1.4.6). 


(g) If either v or w is 0 then v x w = 0 by part (e), and either v = 0 = Ow or w = 0 = Ov, so Vv 
and w are scalar multiples, i.e. they are parallel. 

If both v and w are nonzero, and 0 is the angle between them, then by formula (1.11), 
vxw = O0if and only if llvll ll w Il sin@ = 0, which is true if and only if sin@ = 0 (since |lv|l > 0 
and |lw|| > 0). So since 0° < 0 < 180°, then sin(0 = 0 if and only if 0 = 0° or 180°. But the 
angle between v and w is 0° or 180° if and only if v || w. QED 


Example 1.11. Adding to Example 1.7, we have 
ixj=k jxk=i kxi=j 
jxi=-—k kxj=-i ix k= زٌَ-‎ 
ixi=jxj=kxk=0 
Recall from geometry that a parallelepiped is a 3-dimensional solid with 6 faces, all of 


which are parallelograms. 


6An equivalent definition of a parallelepiped is: the collection of all scalar combinations #1v1 + kava + Fgvg of 
some vectors V1, V2, Vg in R3, where 0 < k1,k2,kg <1. 
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Example 1.12. Volume of a parallelepiped: Let the vectors u, v, w in R3 represent adjacent 
sides of a parallelepiped P, with u, v, w forming a right-handed system, as in Figure 1.4.17. 
Show that the volume of P is the scalar triple product u» (v x w). 


Solution: Recall that the volume vol(P) of a par- ¥*W 


allelepiped P is the area A of the base parallel- 
ogram times the height Ah. By Theorem 1.13(b), 
the area A of the base parallelogram is jl v x wl. 
And we can see that since v x w is perpendicular 
to the base parallelogram determined by v and 
w, then the height A is |lul|| cos0, where 0 is the 
angle between u and v x w. By Theorem 1.6 we 


know that 
u-(v xw) 
cos0 =. Hence, Figure 1.4.7 Parallelepiped P 
lull lv x wll 
vol(P)= Ah 

ll ull u-(v x w) 

= || y x w|| u 

lull llv x wll 
= u (v Xx Ww) 


In Example 1.12 the height A of the parallelepiped is [lull cos@0, and not —|l ull cos@0, be- 
cause the vector u is on the same side of the base parallelogram’s plane as the vector v x w 
(so that cosÛ > 0). Since the volume is the same no matter which base and height we use, 
then repeating the same steps using the base determined by u and v (since w is on the same 
side of that base’s plane as u x v), the volume is w‘(u x v). Repeating this with the base 
determined by w and u, we have the following result: 


For any vectors u, v, w in RÎ, 


u‘(v Xx wW)= w‘(u xX Vv) = v‘*(w X u) 
(Note that the equalities hold trivially if any of the vectors are 0.) 
Since v xw = —w x v for any vectors v, w in RF, then picking the wrong order for the three 
adjacent sides in the scalar triple product in formula (1.12) will give you the negative of the 


volume of the parallelepiped. So taking the absolute value of the scalar triple product for 
any order of the three adjacent sides will always give the volume: 


Theorem 1.15. If vectors u, v, w in R represent any three adjacent sides of a paral- 
lelepiped, then the volume of the parallelepiped is |u-(v x w)|. 


Another type of triple product is the vector triple product u x (v x w). The proof of the 
following theorem is left as an exercise for the reader: 
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Theorem 1.16. For any vectors u, v, w in RP, 
u x (v x w) = (u‘ w)v — (u vV)w (1.15) 


An examination of the formula in Theorem 1.16 gives some idea of the geometry of the 
vector triple product. By the right side of formula (1.13), we see that u x (v x w) is a scalar 
combination of v and w, and hence lies in the plane containing v and w (i.e. u x (v x w), Vv 
and w are coplanar). This makes sense since, by Theorem 1.11, ux (v x w) is perpendicular 
to both u and v x w. In particular, being perpendicular to v x w means that u x (v x w) lies 
in the plane containing v and w, since that plane is itself perpendicular to v x w. But then 
how is u x (v x w) also perpendicular to u, which could be any vector? The following example 
may help to see how this works. 


Example 1.13. Find u x (v x w) for u = (1,2,4), v = (2,2,0), w = (1,3,0). 


Solution: Since u‘v = 6 and uw = 7, then 


u x (v x w) = (u-w)v — (u v)w 
=7 )2,2,0(- 6)1,3,0( = )14, 14, 0(- )6, 18,0( 
= )8,-4,0( 


Note that v and w lie in the xy-plane, and that u x (v x w) also lies in that plane. Also, 
u x (v x w) is perpendicular to both u and v x w = (0,0,4) (see Figure 1.4.8). 


xX 


Figure 1.4.8 


For vectors Vv =u,i+u,j+usk and w = wıi+wa]j+wو‎ k in component form, the cross product 
is written as: VX W = (UW —UsWa)i+ (ugwı—~Uıws)]+ (uwa —u,w,)k. It is often easier to use the 
component form for the cross product, because it can be represented as a determinant. We 
will not go too deeply into the theory of determinants"; we will just cover what is essential 
for our purposes. 


TSee ANTON and RORRES for a fuller development. 
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A 2x2 matrix is an array of two rows and two columns of scalars, written as 


ا 
ا or‏ 


where a,b,c,d are scalars. The determinant of such a matrix, written as 


a b 
c dû 


a b a b 
0 or det , 
is the scalar defined by the following formula: 
a b 
ا‎ = ad —- bc 


It may help to remember this formula as being the product of the scalars on the downward 
diagonal minus the product of the scalars on the upward diagonal. 


Example 1.14. 


1 - (14(- 28) = 4-6 = -2 


3 4 
A 3 x3 matrix is an array of three rows and three columns of scalars, written as 

dı da ds dı @dد‎ @dşg 

bı bı b3 Or Dı b» bs 

C1 C2 C83 C1 C2 C3 


and its determinant is given by the formula: 


dı da @dşg 

b, bı b3 = da1 b» 0 — d2 0 bs + daş a bı (1.14) 
2 €8 1 €8 C1 Co» 

C1 C2 Cg 


One way to remember the above formula is the following: multiply each scalar in the first 
row by the determinant of the 2 x 2 matrix that remains after removing the row and column 
that contain that scalar, then sum those products up, putting alternating plus and minus 
signs in front of each (starting with a plus). 


Example 1.15. 


= 1)-2-0(-0)8-3(+2)0+1( =0 


4 
1 0 
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We defined the determinant as a scalar, derived from algebraic operations on scalar en- 
tries in a matrix. However, if we put three vectors in the first row of a 3 x 3 matrix, then 
the definition still makes sense, since we would be performing scalar multiplication on those 
three vectors (they would be multiplied by the 2x 2 scalar determinants as before). This gives 
us a determinant that is now a vector, and lets us write the cross product of v = u,i+u,j+ugk 
and w= wıi+wa]j +wşk as a determinant: 

i j k 
U1 U, Ug 
wı WwW Ws 


VXW= 


= (UW ¬ UgWa2)İ + (UgW1 — U1 W3) + (Uw — Uw 1)k 


Example 1.16. Let v = 4i—-j+3k and w =i+ 2k. Then 


i j k 
V<W=| 4 —-1 @3 |= i j + E k= -2i-5j+k 
1 0 2 0 2 1 2 1 0 


The scalar triple product can also be written as a determinant. In fact, by Example 1.12, 
the following theorem provides an alternate definition of the determinant of a 3 x 3 matrix 
as the volume of a parallelepiped whose adjacent sides are the rows of the matrix and form 
a right-handed system (a left-handed system would give the negative volume). 


Theorem 1.17. For any vectors U = (U1,Ua,U3), V = (U1,U2,Ug), W = (W1,Wa2, Ws) in R3: 


U1 U2 Usa 
U1 U2 Us 
Wı W2 Ws 


u-(v x w) = (1.15) 


Example 1.17. Find the volume of the parallelepiped with adjacent sides u = (2,1,3), v = 
(-1,3,2), w = (1,1,-2) (see Figure 1.4.9). 


Solution: By Theorem 1.15, the volume vol(P) of the parallelepiped 
P is the absolute value of the scalar triple product of the three 
adjacent sides (in any order). By Theorem 1.17, 


1 8 
uc(vxw)=| -1 3 2 4 
IR #9 
3 2 -1 2 e 
-2 ت‎ 
َ | 1 A 1 ُ Figure 1.4.9 P 


= 2)-8(¬- 1)0( + 3)-4( = - 28, s0 
vol(P) =|—-28| = 28. 
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Interchanging the dot and cross products can be useful in proving vector identities: 


UW U’°z : 
for all vectors u, v, w, z in RÎ. 
VW Z 


Example 1.18. Prove: (u x v)۰*(w x z) = | 


Solution: Let x = u x v. Then 


(u xX V)‘(W X Z) = x°*(W X Z) 


= w*(zXx x) (by formula (1.12)) 


= W*(Z X (u Xx V)) 


= w°((z*v)u — (z*u)v) (by Theorem 1.16) 


= (Z°*V)(w ° u) — (Z*u)(w* V) 


Zz)(v-w) (by commutativity of the dot product). 


Exercises 


2. v= (7,2,-10(, w = )2,6,4( 
4. v= (1,3,2), w = (7,2,-10( 
6. v =i, w=3i+2j+ 4k 


= (u- w)(v- Zz) — (u 


U‘Ww U°*Z 
VW V°Z 


A 


For Exercises 1-6, calculate v x w. 
1. v= (5,1,-2), w = (4,-4,8( 

3. v= (2,1,4), w = (1,-2,0( 

5. v= -i+2]j+k, w = -31+6j+3k 


For Exercises 7-8, calculate the area of the triangle APQER. 


1. P=(5,1,-2), Q = (4,-4,3), R = (2,4,0) 8. P = (4,0,2), Q = (2,1,5), R = (-1,0,-1) 


For Exercises 9-10, calculate the area of the parallelogram PQRS. 


9. P = (2,1,3), Q = (1,4,5), R = (2,5,3), S = (3,2,1) 


10. P =(-2,-2), Q = (1,4), R = (6,6), S = (3,0) 


For Exercises 11-12, find the volume of the parallelepiped with adjacent sides u, v, w. 


11. u= (1,1,3), V = (2,1,4), W = (5,1,-2( 12. u= (1,3,2), V = (7,2,-10(, w = (1,0,1( 


For Exercises 13-14, calculate u-‘(v x w) and u x (v x w). 


14. u= (1,0,2), v = (-1,0,3(, w = (2,0,-2( 


13. u= (1,1,1), v = (3,0,2), w = (2,2,2) 


15. Calculate (u x v)«(w x z) for u = (1,1, 1), v = (3,0,2), w = (2,2,2), z= (2,1,4). 
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16. If v and w are unit vectors in RF, under what condition(s) would v x w also be a unit 
vector in R3 ? Justify your answer. 


17. Show that if v x w = 0 for all w in RF, then v = 0. 


18. Prove Theorem 1.14(b). 19. Prove Theorem 1.14(c). 
20. Prove Theorem 1.14(d). 21. Prove Theorem 1.14(e). 
22. Prove Theorem 1.14(). 23. Prove Theorem 1.16. 


24. Prove Theorem 1.17. (Hint: Expand both sides of the equation.) 


25. Prove the following for all vectors v, w in R3: 
(a) lv x wl +l v-w|% = lvl ll wll 


(b) Ifv-w =0 and v x w =0, then v = 0 or w =0. 
C 


26. Prove that in Example 1.8 the formula for the area of the triangle APQR yields the 
same value no matter which two adjacent sides are chosen. To do this, show that || u x 
(-w()|| = 3 lv x wil, where u = PR, -w = PQ, and v = QR, w = QP as before. Similarly, 
show that 1 || (u) x (-v)|| = 1 Iv x wll, where -u = RP and -v = RQ. 


27. Consider the vector equation a x x = b in RF, where a ¥ 0. Show that: 


(a) a‘b=0 


bxa 
(b) x= > +Ra is a solution to the equation, for any scalar 


all 


28. Prove the Jacobi identity: u x (v x w)+ v x (w x u) + w x (u x v) =0 
29. Show that u, v, w lie in the same plane in RF if and only if u-(v x w) = 0. 


30. For all vectors u, v, w, zin RF, show that 
(u x v) x (w x z) = (z* (u xX V))w — (w * (U X V))z 
and that 
(u x v) x (w x Zz) = (u-(w x Z))v — (v-(w x Z))u 


Why do both equations make sense geometrically? 
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1.5 Lines and Planes 


Now that we know how to perform some operations on vectors, we can start to deal with some 
familiar geometric objects, like lines and planes, in the language of vectors. The reason 
for doing this is simple: using vectors makes it easier to study objects in 3-dimensional 
Euclidean space. We will first consider lines. 


Line through a point, parallel to a vector 


Let P = (<o, y0,20) be a point in RF, let v = (a,b,c) be a nonzero vector, and let L be the line 
through P which is parallel to v (see Figure 1.5.1). 


Figure 1.5.1 


Let r = (<o, yo,20) be the vector pointing from the origin to P. Since multiplying the vector 
v by a scalar t lengthens or shrinks v while preserving its direction if t > 0, and reversing 
its direction if t < 0, then we see from Figure 1.5.1 that every point on the line L can be 
obtained by adding the vector tv to the vector r for some scalar t. That is, as t varies over all 
real numbers, the vector r + tv will point to every point on L. We can summarize the vector 
representation of L as follows: 


For a point P = (<o, Yo0,20) and nonzero vector v in RF, the line L through P parallel to v 
is given by 
r+tv, for -oo0<t<oO )1.16( 


where r = (xo, Yo,2o) is the vector pointing to P. 


Note that we used the correspondence between a vector and its terminal point. Since 
v = (a,b,c), then the terminal point of the vector r+ tv is (x, + at, yo + bt, zo + ct). We then get 
the parametric representation of L with the parameter t: 


For a point P = (<o, 70,20) and nonzero vector v = (a,b,c) in RF, the line L through P 
parallel to v consists of all points (x,y,z) given by 


x=x+at, Jy=yo+bt, z=z20+ct, for -o0<t<oO (1.17) 


Note that in both representations we get the point P on L by letting t = 0. 
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In formula (1.17), if a # 0, then we can solve for the parameter t: t = (x—x,)/a. We can also 
solve for t in terms of y and in terms of z if neither b nor c, respectively, is zero: t = (y— y,)/b 
and t = (z—zo)/c. These three values all equal the same value t, so we can write the following 
system of equalities, called the symmetric representation of L: 


For a point P = (xo, 70,20) and vector v = (a,b,c) in R3 with a, b and c all nonzero, the line 
L through P parallel to v consists of all points (x, y,z) given by the equations 


X-% _ Y= %0 _ 2 >۶0 
ت س‎ 1.18 
a b cC ( ) 


What if, say, a = 0 in the above scenario? We can not divide by 
zero, but we do know that x = xı + at, and so x = x, + Ot = xo. Then the 
symmetric representation of L would be: 


)1.19( @ د و 


۰ 


>, 
1 


b Cc = 

Note that this says that the line L lies in the plane x = x, which is ا‎ 
parallel to the yz-plane (see Figure 1.5.2). Similar equations can be 3 
derived for the cases when b = O0orc =0. Figure 1.5.2 

You may have noticed that the vector representation of L in formula (1.16) is more compact 
than the parametric and symmetric formulas. That is an advantage of using vector notation. 
Technically, though, the vector representation gives us the vectors whose terminal points 
make up the line L, not just L itself. So you have to remember to identify the vectors r + tv 
with their terminal points. On the other hand, the parametric representation always gives 
just the points on L and nothing else. 


Example 1.19. Write the line L through the point P = (2,3,5) and parallel to the vector 
v = (4,—-1,6), in the following forms: (a) vector, (b) parametric, (c) symmetric. Lastly: (dQ) find 
two points on L distinct from P. 


Solution: (a) Let r = (2,3,5). Then by formula (1.16), L is given by: 
r+tv = (2,3,5)+t(4,-1,6), for -oo<t<oo 
(b) L consists of the points (x, y,z) such that 
x=2+4t, y=3-t, 2=5+6t, for -o0<t<oo 
(c) L consists of the points (x,y,z) such that 


XxX-2 y-3 _ z2-5 


4 -1 6 
(d) Letting t = 1 and t = 2 in part(b) yields the points (6,2, 11) and (10,1, 17) on L. 
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Line through two points 


Let Pı = (x1,Y1,2,) and P; = (xs, y2,z2) be distinct points 
in RF, and let L be the line through P, and P,. Let r, = 
(1,1,21) and r» = (Xa, yY2,2,) be the vectors pointing to P, 
and P», respectively. Then as we can see from Figure 
1.5.3, Fr, ~r, is the vector from P, to P,. So if we multiply 
the vector r, ~r, by a scalar t and add it to the vector 
rı, we will get the entire line L as t varies over all real 
numbers. The following is a summary of the vector, para- 
metric, and symmetric forms for the line L: Figure 1.5.3 


4 
> 


Let Pı = (%1, ¥Y1,21), Ps, = (%9, ¥:,2») be distinct points in RF, and let r, = Vu. — 
(x2, J2,22). Then the line L through P, and P, has the following representations: 


Vector: 
rı +t(r—r,), for -oo0<t<oO )1.20( 


Parametric: 
X=x +(x x1), Jy=y+(ys— y1), 2=21+(2,—21)t, fOr -o00< < oO (1.21) 


Symmetric: 


E —Y 2 ا‎ 
CES EET (1.22) 
XX J2 -¥Y1 22-21 


Example 1.20. Write the line L through the points P, = (-83,1,-4) and P, = (4,4,6) in 
parametric form. 


Solution: By formula (1.21), L consists of the points (x, y,z) such that 
x=-3+7t, y=1+3t, z2=-4-2t, for -o0<t<oO 
Distance between a point and a line 


Let L be a line in RÛ in vector form as r+ tv (for —oo < t < oo), P 
and let P be a point not on L. The distance d from P to L is the w/ ıd 
length of the line segment from P to L which is perpendicular to L 4A ! 
(see Figure 1.5.4). Pick a point Q on L, and let w be the vector from Q4 
Q to P. If 0 is the angle between w and v, then d = |lwl||l sin0. So 


Fi 1.5.4 
since |lv x wll = ll vl| ll w|| sin@ and v ¥ 0, then: E 


)1.23( اف 
vl‏ 
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Example 1.21. Find the distance d from the point P = (1, 1, 1) to the line L in Example 1.20. 
Solution: From Example 1.20, we see that we can represent L in vector form as: r+tv, for 
r= (-83,1,-4) and v = (7,3,-2). Since the point Q = (-3,1,—-4) is on L, then for w = QP = 
(1,1, 1)-(-8,1,-4) = (4,0,5), we have: 


i j k 8 
E E E j + k= 15i-43j-12k , so 
0 5 4 5 4 0 
40 5 
8 lvxwil __ | 15i-43j-12k| _ ۷152 + (-43(2 + (-12( _ 8 8 


ivi |7,3,-2| 72 + 32 + (2 V63 


It is clear that two lines L, and L,, represented in vector form as Fı +sv, and Fr, + tv», 
respectively, are parallel (denoted as L, || L2) if vı and v, are parallel. Also, L, and L» are 
perpendicular (denoted as L, L L»,) if vı and v, are perpendicular. 


In 2-dimensional space, two lines are either identical, parallel, or they ١ ۶ ١ 
intersect. In 3-dimensional space, there is an additional possibility: two Te e 
lines can be skew, that is, they do not intersect but they are not parallel. 1 
However, even though they are not parallel, skew lines are on parallel ' اا‎ 
planes (see Figure 1.5.5). 

To determine whether two lines in RF intersect, it is often easier to use 
the parametric representation of the lines. In this case, you should use dif- 
ferent parameter variables (usually s and t) for the lines, since the values of the parameters 
may not be the same at the point of intersection. Setting the two (x, y,z) triples equal will 
result in a system of 3 equations in 2 unknowns (s and t). 


0 
x 


Figure 1.5.5 


Example 1.22. Find the point of intersection (if any) of the following lines: 


+1 -2 -1 -8 +3 
ا ا‎ and E 


3 2 -1 -3 2 


Solution: First we write the lines in parametric form, with parameters s and t: 
x=-1+3s, yJ=2+2s, z2=1-s and x=-3+t, J=8-3t, z2=-3+231 
The lines intersect when (~1+ 3s,2+2s,1—-s)= (-3+t,8-— 3t,—3 + 2t) for some s8, t: 


-1+83s=-@3+t: <= +6 
2+2s=8-3t: <2+2s=8-3)2+ 3s) =2-9sg > 2g = -9g =ء<>‎ 0> = 2+ 3)0( = 2 
1-s=-3+2t: 1-0=-8+2(2)( > 1=1 ⁄ (Note that we had to check this.) 


Letting s = 0 in the equations for the first line, or letting t = 2 in the equations for the second 
line, gives the point of intersection (~1, 2, 1). 
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We will now consider planes in 3-dimensional Euclidean space. 
Plane through a point, perpendicular to a vector 


Let P be a plane in RF, and suppose it contains a point Po = (Xo, Yo,20). Let n = (a,b,c) be 
a nonzero vector which is perpendicular to the plane P. Such a vector is called a normal 
vector (or just a normal) to the plane. Now let (x,y,z) be any point in the plane P. Then 
the vector r = (%—x%o,YJ = J0,2 — 2o) lies in the plane P (see Figure 1.5.6). So ifr #0, thenr Ln 
and hence n‘r =0. And ifr = 0 then we still have nr =0. 


(%9,2) (%0, ¥020) 


Figure 1.5.6 The plane P 


Conversely, if (x,y,z) is any point in RF such that r = (x — Xo, Y= Jo0,2 ¬ 2o) Z 0 and n‘r =0, 
then r Ln and so (x,y,z) lies in P. This proves the following theorem: 


Theorem 1.18. Let P be a plane in RF, let (%0, ¥0,20) be a point in P, and let n = (a,b,c) be a 
nonzero vector which is perpendicular to P. Then P consists of the points (x, y,z) satisfying 
the vector equation: 

n-r=0 (1.24) 


where r = (x — Xo, Y= y,0,2 ¬ 2o), or equivalently: 
a(x ¬- xo) + b(y — yo) + c(2 ¬ 2o) = 0 (1.25) 


The above equation is called the point-normal form of the plane P. 


Example 1.23. Find the equation of the plane P containing the point (3,1,3) and perpen- 
dicular to the vector n = (2,4,8). 


Solution: By formula (1.25), the plane P consists of all points (x, y,z) such that: 
2(x + 3)+ 4(y - 1) + 8(2 -83( = 0 


If we multiply out the terms in formula (1.25) and combine the constant terms, we get an 
equation of the plane in normal form: 


ax+by+cz+d =0 )1.26( 


For example, the normal form of the plane in Example 1.23 is 2x + 4y + 8z - 22 =0. 
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Plane containing three noncollinear points 


In 2-dimensional and 3-dimensional space, two points determine a line. Two points do 
not determine a plane in R®. In fact, three collinear points (i.e. all on the same line) do not 
determine a plane; an infinite number of planes would contain the line on which those three 
points lie. However, three noncollinear points do determine a plane. For if Q, R and S are 
noncollinear points in R3, then QR and QS ar are nonzero vectors which are not , parallel (by 
noncollinearity), and so their cross product QR x QS is perpendicular to both QF and QS . 
So QR and QS (and hence Q, R and S) lie in the plane through the point Q with normal 
vector n = QR Xx QS (see Figure 1.5.7). 


n= QR x QS 


Figure 1.5.7 Noncollinear points Q, R, S 


Example 1.24. Find the equation of the plane P containing the points (2,1,3), (1,-1,2) and 
(3,2,1. 

Solution: Let Q = (2,1,3), R = (1,-1,2) and S = (3,2, 1). Then for the vectors QR = )-1,-2,-1( 
and QS = (1,1,2), the plane P has a normal vector 


n = QÊ x QS = (-1,-2,-1) x (1,1,-2( = (5,-8,1( 


So using formula (1.25) with the point Q (we could also use R or S), the plane P consists of 
all points (x,y,z) such that: 


5()x- 2)-83(y- 1+ (2-3) =0 
or in normal form, 
5x -3y+z2—-10=0 


We mentioned earlier that skew lines in R lie on separate, parallel planes. So two skew 
lines do not determine a plane. But two (nonidentical) lines which either intersect or are 
parallel do determine a plane. In both cases, to find the equation of the plane that contains 
those two lines, simply pick from the two lines a total of three noncollinear points (i.e. one 
point from one line and two points from the other), then use the technique above, as in 
Example 1.24, to write the equation. We will leave examples of this as exercises for the 
reader. 
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Distance between a point and a plane 


The distance between a point in R3 and a plane is the length of the line segment from 
that point to the plane which is perpendicular to the plane. The following theorem gives a 
formula for that distance. 


Theorem 1.19. Let Q = (xo, 70,20) be a point in RF, and let P be a plane with normal form 
ax +by+cz +d =0 that does not contain Q. Then the distance D from Q to P is: 


|axo + by, + czo + d| 
Va + b2 + c2 


Proof: Let R = (x,y,z) be any point in the plane P (so that ax + by + cz +d = 0) and let 
r= RQ = (x) = X,Y = J,2, ~2). Then r ¥ 0 since Q does not lie in P. From the normal form 
equation for P, we know that n = (a,b,c) is a normal vector for P. Now, any plane divides 
RF into two disjoint parts. Assume that n points toward the side of P where the point Q 
is located. Place n so that its initial point is at R, and let 0 be the angle between r and 
n. Then 0° <0 < 90°, so cosÛ0 > 0. Thus, the distance D is cos0 |lr|l = |cos0| llr || (see Figure 
1.5.8). 


D (1.27) 


P 
Figure 1.5.8 
n-r 
By Theorem 1.6 in Section 1.3, we know that cos = NIE SO 
n-r n-r —- x%)+ b(yo — y)+ 
SRE a O O E 
nll llrll nll Va + b2 + c2 
_ lax +byo + cz, (ax + by +cz)| lax, + byo + c2o—-(-d)| _ lax, + by, + c20 + d| 
Va + b% + ce Va + b2 + ce Va + b2 + ce 


If n points away from the side of P where the point Q is located, then 90° < 0 < 180° and 
so cosÛ < 0. The distance D is then |cos0| llrll, and thus repeating the same argument as 
above still gives the same result. QED 


Example 1.25. Find the distance D from (2,4, -5) to the plane from Example 1.24. 
Solution: Recall that the plane is given by 5x - 3y +z -10 =0. So 
D= |5(2)-3(4)+ 1(-5)- 10| |-17|_ 117 


7 کک 
735 ۷35 1 + 8(2-) + 752 
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Line of intersection of two planes 


Note that two planes are parallel if they have normal vectors that 
are parallel, and the planes are perpendicular if their normal vectors 
are perpendicular. If two planes do intersect, they do so in a line (see 
Figure 1.5.9). Suppose that two planes P, and P, with normal vectors 
n, and n,, respectively, intersect in a line L. Since n, x n, L n,, then 
n, x n, is parallel to the plane P,. Likewise, n, x n, L n, means that 
n, x n, is also parallel to P,. Thus, n, x n, is parallel to the intersection 
of P, and P», i.e. n, x n, is parallel to L. Thus, we can write L in the following vector form: 


Figure 1.5.9 


L:r+t(n, xn), for -o0<t<oO (1.28) 


where r is any vector pointing to a point belonging to both planes. To find a point in both 
planes, find a common solution (x, y,z) to the two normal form equations of the planes. This 
can often be made easier by setting one of the coordinate variables to zero, which leaves you 
to solve two equations in just two unknowns. 


Example 1.26. Find the line of intersection L of the planes 5x -3y+z-10 =0 and 2x +4y-— 
F330, 

Solution: The plane 5x - 3y +z - 10 = 0 has normal vector n, = (5,—-3, 1) and the plane 2x + 
4y—-z +83 =0 has normal vector n, = (2,4,—-1). Since n, and n, are not scalar multiples, then 
the two planes are not parallel and hence will intersect. A point (x,y,z) on both planes will 
satisfy the following system of two equations in three unknowns: 


5x -3y+z2—-10 =0 
2x+4y—-z+ 3=0 


Set x = 0 (why is that a good choice?). Then the above equations are reduced to: 


-@3yر+‎ 2-10 =0 
4y-z+ 3=0 


The second equation gives z = 4y + 3, substituting that into the first equation gives y =1. 
Then z = 31, and so the point (0,7,31) is on L. Since n, x n, = (-1,7,26), then L is given by: 


r+t(n, x Nn,) = (0,7,31)+t(-1,7,26), for -o0<t<oo 
or in parametric form: 


x=-t, y=T+'t, z2=831+26t, for -o0<t<oo 


1.5 Lines and Planes 39 


Exercises 
A 


For Exercises 1-4, write the line L through the point P and parallel to the vector v in the 
following forms: (a) vector, (b) parametric, and (c) symmetric. 


1. P= (2,3,-2), v= (5,4,-3( 2. P= (3,-1,2), v= (2,8,1) 
3. P = (2,1,3), v= (1,0,1) 4. P = (0,0,0), v= (7,2,-10( 


For Exercises 5-6, write the line L through the points P, and P, in parametric form. 


5. P, = (1,-2,-83), P= (3,5,5) 6. P, = (4,1,5), P, = (-2,1,3) 


For Exercises '7-8, find the distance d from the point P to the line L. 
1. P=(1,-1,-1), L:ix=-2-2t,y=4t, 2= T+t 
8. P=(0,0,0), L:ix=3+2t, y=4+3t, 2= 5+ 4t 
For Exercises 9-10, find the point of intersection (if any) of the given lines. 
9. x=7+3s, y= -4-3s, 2= -71-5s and x=1+ 6t, y=2+t, 2 =3-1 
x-6 x-11 y-14 z+9 

10. = رy+83=‎ d چ‎ 

0 71 y+ z an 3 E 5 
For Exercises 11-12, write the normal form of the plane P containing the point Q and per- 
pendicular to the vector n. 


11. Q =(5,1,-2), n = (4, -4,3( 12. Q = (6,-2,0), n = (2,6,4) 

For Exercises 13-14, write the normal form of the plane containing the given points. 
13. (1,0,8), (1,2,-1), (6,1,6) 14. (-3,1,-3), (4,-4,3(, )0,0,1( 

15. Write the normal form of the plane containing the lines from Exercise 9. 

16. Write the normal form of the plane containing the lines from Exercise 10. 

For Exercises 17-18, find the distance D from the point Q to the plane P. 

17. Q =(4,1,2), P:3x—-y—-5z+8=0 18. Q =(0,2,0), P:-5x+2y-71z+1=0 
For Exercises 19-20, find the line of intersection (if any) of the given planes. 

19. x<+3y+22-6=0, 2x -y+2+2 =0 20. 3x+y-5z2=0,x+2y+2+4 =0 

B 


21. Find the point(s) of intersection (if any) of the line 0 


6 
= y+3 = z with the plane 
x+ 83y + 22-6 =0. (Hint: Put the equations of the line into the equation of the plane.) 
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1.6 Surfaces 


In the previous section we discussed planes in Euclidean space. A plane is an example of 
a surface, which we will define informally as the solution set of the equation F(x, y,z) = 0 
in RF, for some real-valued function F. For example, a plane given by ax + by +cz +d =0 
is the solution set of F(x, y,z) = 0 for the function F(x,y,z2) = ax + by + cz +d. Surfaces are 
2-dimensional. The plane is the simplest surface, since it is “flat”. In this section we will 
look at some surfaces that are more complex, the most important of which are the sphere 
and the cylinder. 


Definition 1.9. A sphere S is the set of all points (x,y,z) in R3 which are a fixed distance r 
(called the radius) from a fixed point Po = (xo, Yo,2,) (called the center of the sphere): 


S = {(e,y,2) : (£= (2 + )y = (2 + )2 - 2(7 = r ( )1.29( 
Using vector notation, this can be written in the equivalent form: 
S={x: |lx—xo|l =r} (1.30) 
where x = (x, y,Z) and xo = (Xo, yo,20) are vectors. 


Figure 1.6.1 illustrates the vectorial approach to spheres. 


2 


% 
(a) radius r, center (0, 0,0) (b) radius r, center (<0, 0,20) 


Figure 1.6.1 Spheres in RF 


Note in Figure 1.6.1(a) that the intersection of the sphere with the xy-plane is a circle 
of radius r (i.e. a great circle, given by x + y = r as a subset of R2). Similarly for the 
intersections with the xz-plane and the yz-plane. In general, a plane intersects a sphere 
either at a single point or in a circle. 


SSee O’NEILL for a deeper and more rigorous discussion of surfaces. 
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Example 1.27. Find the intersection of the sphere x% + y% + 2% = 169 with the plane z = 12. 


Solution: The sphere is centered at the origin and has radius 
13 = ۷169, so it does intersect the plane z = 12. Putting 
z = 12 into the equation of the sphere gives 
„2 + 2ر‎ + 12 = 9 
2 + ر‎ = 169 - 144 = 25 = 5 


which is a circle of radius 5 centered at (0,0, 12), parallel to 
the xy-plane (see Figure 1.6.2). Figure 1.6.2 


If the equation in formula (1.29) is multiplied out, we get an equation of the form: 
x2 + y2 +2 +ax+by+cz+d =0 (1.31) 


for some constants a, b, c and d. Conversely, an equation of this form may describe a sphere, 
which can be determined by completing the square for the x, y and z variables. 


Example 1.28. Is 2x2 + 2y + 22 - 8x + 4y ¬ 16z + 10 = 0 the equation of a sphere? 
Solution: Dividing both sides of the equation by 2 gives 


2+ ر‎ + 2-4 + 2y -82+ 5 =0 
(2 - 4x + 4) + (y+ 2y + 1) + (2 - 82+ 16) + 5-4-1-16 =0 
ی(‎ - 2( + )y + 1(7 + )z - 4) = 6 


which is a sphere of radius 4 centered at (2,—-1,4). 


Example 1.29. Find the points(s) of intersection (if any) of the sphere from Example 1.28 
and the line x =3+t, y=1+2t,2=3-t. 


Solution: Put the equations of the line into the equation of the sphere, which was (x — 2) + 
(y+ 1)7 + (z- 4) = 16, and solve for t: 
(3+t-2) + (1+2+1) + -غ-3(‎ 4) = 16 
(t+ 1) + غ2(‎ + 2) +(-t- 1) = 16 
6t +12t-10=0 


4 
The quadratic formula gives the solutions t= 1+ E Putting those two values into the 
equations of the line gives the following two points of intersection: 
4 8 4 1 4 8 4 
2+ — 1+1, 4-——| and |2-———,-1-— 4+ 
V6 ۷6 V6 V6 ۷6 V6 
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If two spheres intersect, they do so either at a single point or in a circle. 


Example 1.30. Find the intersection (if any) of the spheres x + y2 + 2% = 25 and x? + y+ (z— 
2) = 16. 


Solution: For any point (x,y,z) on both spheres, we see that 
2 + 2 +2 = 5 

16 = 2(۶ -2) + ر + 2 

16 - )z- 2(۶ = 25 - ہے‎ 


and‏ ,25-۶ = ر + 2× ج 

-)z2-2(, SO‏ 16 = ر + 2× ج 

> 4z -4=9 > z2 =4 

6 = ”(13/4) - 25 = ر + ج 


.'. The intersection is the circle x + y% = 21 of radius ا‎ x 3.8 centered at (0,0, #). 


The cylinders that we will consider are right circular cylinders. These are cylinders ob- 
tained by moving a line L along a circle C in RÎ in a way so that L is always perpendicular 
to the plane containing C. We will only consider the cases where the plane containing C is 
parallel to one of the three coordinate planes (see Figure 1.6.3). 


2 2 


(a) x + y2 =r, any z (b) x? +22 =r, any y (e) y2 +z =r“, any x 


Figure 1.6.3 Cylinders in RF 


For example, the equation of a cylinder whose base circle C lies in the x<y-plane and is 
centered at (a, b,0) and has radius r is 


(e - a) + ر)‎ - b( = r, (1.32( 


where the value of the z coordinate is unrestricted. Similar equations can be written when 
the base circle lies in one of the other coordinate planes. A plane intersects a right circular 
cylinder in a circle, ellipse, or one or two lines, depending on whether that plane is parallel, 
oblique®, or perpendicular, respectively, to the plane containing C. The intersection of a 
surface with a plane is called the trace of the surface. 


i.e. at an angle strictly between 0° and 90°. 
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The equations of spheres and cylinders are examples of second-degree equations in RF, i.e. 
equations of the form 


Ax + By +Cz22+Dxy+Exz+Fyz+Gx+Hy+Iz+J =0 (1.33( 


for some constants A, B, ..., J. Ifthe above equation is not that of a sphere, cylinder, plane, 
line or point, then the resulting surface is called a quadric surface. 


One type of quadric surface is the ellipsoid, given چ‎ 
by an equation of the form: 


++ =1 1.34 
a 2ط‎ cc e 
In the case where a = b = c, this is just a sphere. 
In general, an ellipsoid is egg-shaped (think of an 
ellipse rotated around its major axis). Its traces in 
the coordinate planes are ellipses. Figure 1.6.4 Ellipsoid 


Two other types of quadric surfaces are the hyperboloid of one sheet, given by an 
equation of the form: 


)1.35( ا 
C‏ 


E a =1 (1.36) 


x 


Figure 1.6.5 Hyperboloid of one sheet Figure 1.6.6 Hyperboloid of two sheets 
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For the hyperboloid of one sheet, the trace in any plane parallel to the xy-plane is an 
ellipse. The traces in the planes parallel to the xz- or yz-planes are hyperbolas (see Figure 
1.6.5), except for the special cases x = +a and y = +b; in those planes the traces are pairs of 


intersecting lines (see Exercise 8). 


For the hyperboloid of two sheets, the trace in any plane parallel to the xy- or xz-plane is 
a hyperbola (see Figure 1.6.6). There is no trace in the yz-plane. In any plane parallel to the 


x 
Figure 1.6.7 Paraboloid 


yz-plane for which |x| > |a|, the trace is an ellipse. 


The elliptic paraboloid is another type of quadric surface, 
whose equation has the form: 


x2 ر‎ 


3 + ا‎ (1.87) 
The traces in planes parallel to the xy-plane are ellipses, though 
in the xy-plane itself the trace is a single point. The traces in 
planes parallel to the xz- or yz-planes are parabolas. Figure 
1.6.7 shows the case where c > 0. When c < 0 the surface is 
turned downward. In the case where a = b, the surface is called 
a paraboloid of revolution, which is often used as a reflecting sur- 
face, e.g. in vehicle headlights." 


A more complicated quadric surface is the hyperbolic paraboloid, given by: 


(1.38) 


-10 


100 r” 


I 


10 10 


Figure 1.6.8 Hyperbolic paraboloid 


10 For a discussion of this see pp. 157-158 in HECHT. 
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The hyperbolic paraboloid can be tricky to draw; using graphing software on a computer 
can make it easier. For example, Figure 1.6.8 was created using the free Gnuplot package 
(see Appendix C). It shows the graph of the hyperbolic paraboloid z = y - x7, which is the 
special case where a = b = 1 and c = —-1in equation (1.38). The mesh lines on the surface are 
the traces in planes parallel to the coordinate planes. So we see that the traces in planes 
parallel to the xz-plane are parabolas pointing upward, while the traces in planes parallel 
to the yz-plane are parabolas pointing downward. Also, notice that the traces in planes 
parallel to the xy-plane are hyperbolas, though in the xy-plane itself the trace is a pair of 
intersecting lines through the origin. This is true in general when c < 0 in equation (1.38). 
When c > 0, the surface would be similar to that in Figure 1.6.8, only rotated 90° around 
the z-axis and the nature of the traces in planes parallel to the xz- or yz-planes would be 
reversed. 


The last type of quadric surface that we will consider is the 
elliptic cone, which has an equation of the form: 


xX y 2z 
ID 2 
a“ Db c 


=0 (1.39) 


e 


The traces in planes parallel to the xy-plane are ellipses, ex- 
cept in the xy-plane itself where the trace is a single point. 
The traces in planes parallel to the xz- or yz-planes are hyper- 
bolas, except in the xz- and yz-planes themselves where the 
traces are pairs of intersecting lines. 4 

Notice that every point on the elliptic cone is on a line which 
lies entirely on the surface; in Figure 1.6.9 these lines all go 
through the origin. This makes the elliptic cone an example of 
a ruled surface. The cylinder is also a ruled surface. 

What may not be as obvious is that both the hyperboloid of one sheet and the hyperbolic 
paraboloid are ruled surfaces. In fact, on both surfaces there are two lines through each 
point on the surface (see Exercises 11-12). Such surfaces are called doubly ruled surfaces, 
and the pairs of lines are called a regulus. 

It is clear that for each of the six types of quadric surfaces that we discussed, the surface 
can be translated away from the origin (e.g. by replacing x by (x—4x,)% in its equation). It can 
be proved" that every quadric surface can be translated and/or rotated so that its equation 
matches one of the six types that we described. For example, z = 2xy is a case of equation 
(1.33) with “mixed” variables, e.g. with D # 0 so that we get an xy term. This equation does 
not match any of the types we considered. However, by rotating the x- and y-axes by 45° in 
the xy-plane by means of the coordinate transformation x = (xy )/V2, y = (x+y V2, z2 = z!, 
then z = 2xy becomes the hyperbolic paraboloid z' = (x) — (y')% in the (x, y',2') coordinate 
system. That is, z = 2xy is a hyperbolic paraboloid as in equation (1.38), but rotated 45° in 
the xy-plane. 


Figure 1.6.9 Elliptic cone 


11See Ch. 7 in POGORELOV. 
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Exercises 
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A 


For Exercises 1-4, determine if the given equation describes a sphere. If so, find its radius 
and center. 


×2 + y+ 2-4 - 6y - 102+37 =0 2. x+y +2 + 2-2-82 +19 =0 
24 + 2y + 22 + 4x + 4y + 42-44 = 0 4. x2 + y-2 +12 + 2y - 42+ 82 =0 


Find the point(s) of intersection of the sphere (x - 3) + (y + 1) + (z - 3) = 9 and the line 
x= -1+2t, y= -2-t, 2= 3+. 


Find the intersection of the spheres 42 + ر‎ + 22 =9 and (x — 4) + ( y+ 2)2 + (2-42 =9. 


„. Find the intersection of the sphere x + y% +z = 9 and the cylinder x + y2 = 4. 


2 
2 y 22 


. Find the trace of the hyperboloid of one sheet &ş + وک ¬ و‎ = 1 in the plane x = a, and the 


a Cc 


trace in the plane y =D. 
Find the trace of the hyperbolic paraboloid م‎ . 2 = in the xy-plane. 


1. 
3. 


5. 


6. 


9. 
C 


10. It can be shown that any four noncoplanar points (i.e. points that do not lie in the same 


plane) determine a sphere." Find the equation of the sphere that passes through the 
points (0, 0,0), (0,0,2), (1,-4,3) and (0,-1,3). (Hint: Equation (1.31)) 


11. Show that the hyperboloid of one sheet is a doubly ruled surface, i.e. each point on 


the surface is on two lines lying entirely on the surface. (Hint: Write equation (1.35) as 


2 2 2 9 4 
3= - & =1- 2 , factor each side. Recall that two planes intersect in a line.) 
a c b 


12. Show that the hyperbolic paraboloid is a doubly ruled surface. (Hint: Exercise 11) 


and let S* be S without the “north pole” point (0,0, 2). Let 
(a,b,c) be an arbitrary point on S*. Then the line passing 
through (0,0, 2) and (a, b,c) intersects the xy-plane at some 
point (x, y,0), as in Figure 1.6.10. Find this point (x, y, 0) in 
terms of a, b and c. 

(Note: Every point in the xy-plane can be matched with a 
point on S*, and vice versa, in this manner. This method is 
called stereographic projection, which essentially identifies 
all of RÛ with a “punctured” sphere.) 


Figure 1.6.10 


13. Let S be the sphere with radius 1 centered at (0,0, 1), 


12See WELCHONS and KRICKENBERGER, p. 160, for a proof. 
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1.7 Curvilinear Coordinates 


The Cartesian coordinates of a point (x,y,z) are determined by 2 (%,¥,2( 
following straight paths starting from the origin: first along the 
x-axis, then parallel to the y-axis, then parallel to the z-axis, as 
in Figure 1.7.1. In curvilinear coordinate systems, these paths can 
be curved. The two types of curvilinear coordinates which we will 
consider are cylindrical and spherical coordinates. Instead of ref- y 
erencing a point in terms of sides of a rectangular parallelepiped, 
as with Cartesian coordinates, we will think of the point as ly- 
ing on a cylinder or sphere. Cylindrical coordinates are often used when there is symmetry 
around the z-axis; spherical coordinates are useful when there is symmetry about the origin. 

Let P = (x,y,z) be a point in Cartesian coordinates in R3, and let Po = (x, y,0) be the 
projection of P upon the x<y-plane. Treating (x,y) as a point in R/, let (r,0) be its polar 
coordinates (see Figure 1.7.2). Let p be the length of the line segment from the origin to P, 
and let p be the angle between that line segment and the positive z-axis (see Figure 1.7.3). 
Q is called the zenith angle. Then the cylindrical coordinates (r,0,z) and the spherical 
coordinates (0,0, p) of P(x, y,z) are defined as follows: 


Figure 1.7.1 


Cylindrical coordinates (r,0,z): P(%,y,2) 


x = rcosO ٣ = ۹/2 + 2ر‎ 
y= rsinê 0 = tan" (2) 


€ ي € 


Po(x,y,0) 


where O<sOsr if y>0 and r<0<2r if y<0 


Figure 1.7.2 
Cylindrical coordinates 


Spherical coordinates (0,0, Q%): P(x,y,2) 
x = psinQ cos م‎ = ٣/2 + 2ے + 2ر‎ 
4 
y = psinQp sin@ 0 = tan ` (2) کے‎ 
z2 = pcosp عب )و = م‎ ( 
+ ر‎ +2 2 y Po(,y,0) 


where O<sOsr if y>0 and r<0<2r if y<0 Figure 1.7.3 
Spherical coordinates 

Both 0 and Q are measured in radians. Note that r > 0, O<sÛ0< 2r, p> 0 and O0 < QP <. 
Also, 0 is undefined when (x, y) = (0,0), and Q is undefined when (x,y,z) = (0,0,0). 


13This “standard” definition of spherical coordinates used by mathematicians results in a left-handed system. 
For this reason, physicists usually switch the definitions of 0 and ¢ to make (0,0,4) a right-handed system. 
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Example 1.31. Convert the point (-2,—2,1) from Cartesian coordinates to (a) cylindrical 
and (b) spherical coordinates. 


Solution: (a) r = V(-2)* + (-2)* = 272, 0 = tan" (Z3) = tan 7(1) = ŠZ, since y = -2 <0. 
.'. (0,2) = (272,8, 1( 


(b) p = V(-2) + (-2)2 + 12 = (9 = 8, p= cos (3) x 1.23 radians. 
.'. )0,0,( = )8, 52 ,1.23( 


For cylindrical coordinates (r,0,z), and constants ro, 0, and zo, we see from Figure 1.7.4 
that the surface r = r, is a cylinder of radius r, centered along the z-axis, the surface 0 = 0, 
is a half-plane emanating from the z-axis, and the surface z = z2, is a plane parallel to the 
xy-plane. 


XX 


(e) z2 = 20 


Figure 1.7.4 Cylindrical coordinate surfaces 


For spherical coordinates (0,0,p), and constants 00, 0, and $o, we see from Figure 1.7.5 
that the surface p = p, is a sphere of radius p0, centered at the origin, the surface 0 = 0, is a 
half-plane emanating from the z-axis, and the surface = $, is a circular cone whose vertex 
is at the origin. 


٤ 2 


(b) 0 = 00 


Figure 1.7.5 Spherical coordinate surfaces 


Figures 1.7.4(a) and 1.7.5(a) show how these coordinate systems got their names. 
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Sometimes the equation of a surface in Cartesian coordinates can be transformed into a 
simpler equation in some other coordinate system, as in the following example. 


Example 1.32. Write the equation of the cylinder x + y% = 4 in cylindrical coordinates. 
Solution: Since r = ¥ x + y2, then the equation in cylindrical coordinates is r = 2. 


Using spherical coordinates to write the equation of a sphere does not necessarily make 
the equation simpler, if the sphere is not centered at the origin. 


Example 1.33. Write the equation (x — 2) + (y - 1) + 2% = 9 in spherical coordinates. 
Solution: Multiplying the equation out gives 


x2 + y+ 2-4-2) +5 =9 , So we get 
0 -4psinp cosO - 2psinQ sin(—-4=0 , or 
p*-2sinp(2cos0 -sin0)p =4 =0 


after combining terms. Note that this actually makes it more difficult to figure out what the 
surface is, as opposed to the Cartesian equation where you could immediately identify the 
surface as a sphere of radius 3 centered at (2, 1,0). 


Example 1.34. Describe the surface given by 0 = z in cylindrical coordinates. 


Solution: This surface is called a helicoid. As the (vertical) z coordinate increases, so does 
the angle 0, while the radius r is unrestricted. So this sweeps out a (ruled!) surface shaped 
like a spiral staircase, where the spiral has an infinite radius. Figure 1.7.6 shows a section 
of this surface restricted to 0 = z <= 47 and 0 sr <2. 


Figure 1.7.6 Helicoid 0 = z 
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Exercises 
A 

For Exercises 1-4, find the (a) cylindrical and (b) spherical coordinates of the point whose 
Cartesian coordinates are given. 


1. (2,2⁄8,-1( 2. )-5,5,6( 3. (/21,-7⁄7,0( 4. (0,2,2) 

For Exercises 5-7, write the given equation in (a) cylindrical and (b) spherical coordinates. 
5. 2 + ر + 2× .6 5 = ع + ر‎ = 2y 1 ×2 + y2 + 92 = 6 

B 


8. Describe the intersection of the surfaces whose equations in spherical coordinates are 
0= 3 and p= 7. 


9. Show that for a # 0, the equation p = 2asinQ cos in spherical coordinates describes a 
sphere centered at (a, 0,0) with radius |a|. 


C 


10. Let P = (a,0,p) be a point in spherical coordinates, with a > 0 and 0 < p <r. Then P 
lies on the sphere p = a. Since 0 < Q < r, the line segment from the origin to P can be 
extended to intersect the cylinder given by r = a (in cylindrical coordinates). Find the 
cylindrical coordinates of that point of intersection. 


11. Let P, and P,„ be points whose spherical coordinates are (0,,0,,%,) and (02,02, Pp»), respec- 
tively. Let v, be the vector from the origin to P,, and let v, be the vector from the origin 
to P2. For the angle y between v, and vs, show that 


cosy = cos, cos Ps» + sin Q, sin QP, cos(02 —0,). 


This formula is used in electrodynamics to prove the addition theorem for spherical har- 
monics, which provides a general expression for the electrostatic potential at a point due 
to a unit charge. See pp. 100-102 in JACKSON. 


12. Show that the distance d between the points P, and P, with cylindrical coordinates 
(rı,01ı,2,) and (r2,02,z»), respectively, is 


d= r2 +r2- rr c0s(0, = 0,)+ (2= 21). 


13. Show that the distance d between the points P, and P, with spherical coordinates 
(01ı,01ı,%ı) and (02,0, P2), respectively, is 


d= 2م‎ + p2? - 2p, pa[sinQ, sin Qs» cos(0, — 0,) + cosh, cos pa]. 
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1.8 Vector-Valued Functions 


Now that we are familiar with vectors and their operations, we can begin discussing func- 
tions whose values are vectors. 


Definition 1.10. A vector-valued function of a real variable is a rule that associates a 
vector f(t) with a real number t, where t is in some subset D of R' (called the domain of £. 
We write f: D —~ RF to denote that f is a mapping of D into RF. 


For example, f(t) = ti + tj + tk is a vector-valued function in RF, defined for all real num- 
bers t. We would write f: R ~ R3. At t = 1 the value of the function is the vector i + j + k, 
which in Cartesian coordinates has the terminal point (1, 1, 1). 

A vector-valued function of a real variable can be written in component form as 


f(t) = fı(t)i + fa(t)j + f3(t)k 


or in the form 


f(t) بک‎ (f(t), f(b), f(t) 


for some real-valued functions f(t), f2(t), f(t), called the component functions of f. The first 
form is often used when emphasizing that f(t) is a vector, and the second form is useful 
when considering just the terminal points of the vectors. By identifying vectors with their 
terminal points, a curve in space can be written as a vector-valued function. 


42z 
Example 1.35. Define f:R — R® by f(t) = (cost,sint,t). 
This is the equation of a helix (see Figure 1.8.1). As the value of 
t increases, the terminal points of f(t) trace out a curve spiraling 
upward. For each t, the x- and y-coordinates of f(t) are x = cost f2r) 4 
and y = sint, So 

x2 + 2 = cost + sin t =1. 0) 

XX 

Thus, the curve lies on the surface of the right circular cylinder Figure 1.8.1 


x2 + ر‎ =1. 


It may help to think of vector-valued functions of a real variable in RF as a generalization 
of the parametric functions in R which you learned about in single-variable calculus. Much 
of the theory of real-valued functions of a single real variable can be applied to vector-valued 
functions of a real variable. Since each of the three component functions are real-valued, it 
will sometimes be the case that results from single-variable calculus can simply be applied 
to each of the component functions to yield a similar result for the vector-valued function. 
However, there are times when such generalizations do not hold (see Exercise 13). The 
concept of a limit, though, can be extended naturally to vector-valued functions, as in the 
following definition. 


52 CHAPTER 1. VECTORS IN EUCLIDEAN SPACE 


Definition 1.11. Let f(t) be a vector-valued function, let a be a real number and let c be a 
vector. Then we say that the limit of f(t) as t approaches a equals ce, written as limf(t) =€, 
¬ 


if lim If(E)— ell = 0. If f(E) = (f(t), fat), fs()), then 


limf() = (lim #,(), lim f(4), ima f()) 
t¬a t¬a t¬a t¬>a 
provided that all three limits on the right side exist. 


The above definition shows that continuity and the derivative of vector-valued functions 
can also be defined in terms of its component functions. 


Definition 1.12. Let f(t) = (f(t), f(t), fs(t)) be a vector-valued function, and let a be a real 
number in its domain. Then f(t) is continuous at a if limf(t) = f(a). Equivalently, f(t) is 
¬ 


continuous at a if and only if f(t), f2(t), and f(t) are continuous at a. 


The derivative of f(t) at a, denoted by f(a) or 3 is the limit 


ou f(a + h)— f(a) 
h¬0 h 


if that limit exists. Equivalently, f(a) = (f,'(a), f, (a), fs (a)), if the component derivatives 
exist. We say that f(t) is differentiable at a if f(a) exists. 


Recall that the derivative of a real-valued function of a single variable is a real number, 
representing the slope of the tangent line to the graph of the function at a point. Similarly, 
the derivative of a vector-valued function is a tangent vector to the curve in space which 
the function represents, and it lies on the tangent line to the curve (see Figure 1.8.2). 


42 


ng f) ( 
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Figure 1.8.2 Tangent vector f(a) and tangent line L = f(a) + sf'(a) 


Example 1.36. Let f(t) = (cost,sint,t). Then f'(t) = (-sint,cost, 1) for all t. The tangent line 
L to the curve at f(2r) = (1,0,27) is L = f(2) + sf'(2r) = (1,0,27) + s(0,1, 1), or in parametric 
form: x =1, y=s,z2=2r +8 for o00 <8 < O0. 
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A scalar function is a real-valued function. Note that if u(t) is a scalar function and 
f(t) is a vector-valued function, then their product, defined by (uf)(t) = u(t)f(t) for all t, is a 
vector-valued function (since the product of a scalar with a vector is a vector). 

The basic properties of derivatives of vector-valued functions are summarized in the fol- 
lowing theorem. 


Theorem 1.20. Let f(t) and g(t) be differentiable vector-valued functions, let u(t) be a 
differentiable scalar function, let A be a scalar, and let ce be a constant vector. Then 
(a) “) =0 
df 
(b) r 
d 
(c) < frg= ع س‎ 
0 d 
( ۰ در‎ 
گر‎ 
(e) e 0= er e 
d 
0 E-g= gr f.8 
(g) (fx g8) = +× 


Proof: The proofs of parts (a)-(e) follow easily by differentiating the component functions 
and using the rules for derivatives from single-variable calculus. We will prove part (P, 
and leave the proof of part (g) as an exercise for the reader. 


(© Write f(t) = (f(t), fa(t), fa(t)) and g(t) = (g£1(t),g»(t),g3(t)), where the component functions 
f(t), fa(t), f(t), £1(t), g£2(t), g(t) are all differentiable real-valued functions. Then 


d d 
f ۰ g)(( = D810) + f(D) g(t) + f3(t)gs()) 


I) 4 e ا‎ 


e fı ا‎ : f 


d 1 d 2 3 
AE ا‎ ٠ (gı), g0), £0) 


d d 
, ,غ‎ 83 


7 )9( 


+ (FCD, f(D, f()° 


= ع“‎ + f(t) £ for all t. QED 
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Example 1.37. Suppose f(t) is differentiable. Find the derivative of || f(£)||. 
Solution: Since |lf(t)|l is a real-valued function of t, then by the Chain Rule for real-valued 


d d 
functions, we know that FOI = 2| fC) 7 IIfCEIN. 


But |lf()|l2 = f(t)- f(t), so IFO] = (RDN. Hence, we have 


2ll f()|| IFC) = (RO-) = f(t)‘ f(t) + f(t)°f'(t) by Theorem 1.20(), so 


= 2f(H°f(E) , so if II f(E)|| z O then 
f(t) f(t) 


d 
—|l ft ت‎ 
ي‎ 0 9 


d 
We know that |lf(L)|| is constant if and only if g0 = 0 for all t. Also, f(t) L f(t) if and 
only if f'(t)- f(t) = 0. Thus, the above example shows this important fact: 


If |lf(E)|l| z 0, then |l f(t)|| is constant if and only if f(t) L f(t) for all t. 


This means that if a curve lies completely on a sphere (or circle) centered at the origin, then 
the tangent vector f(t) is always perpendicular to the position vector f(t). 


cost sint -at 
V1+a2t# V1+a2t# V1+ at 
Figure 1.8.3 shows the graph of the curve when a = 0.2. In the exercises, the reader will be 


asked to show that this curve lies on the sphere x + y +z = 1 and to verify directly that 
f(t)‘ f(t) = 0 for all t. 


Example 1.38. The spherical spiral f(t) = ( J for a #0. 


Figure 1.8.3 Spherical spiral with a = 0.2 
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Just as in single-variable calculus, higher-order derivatives of vector-valued functions are 
obtained by repeatedly differentiating the (first) derivative of the function: 


d"f dd d”-1f 
dtr ا‎ 
We can use vector-valued functions to represent physical quantities, such as velocity, ac- 
celeration, force, momentum, etc. For example, let the real variable t represent time elapsed 
from some initial time (t = 0), and suppose that an object of constant mass m is subjected 
to some force so that it moves in space, with its position (x,y,z) at time t a function of 
t. That is, x = x(t), y = y(t), z = z(t) for some real-valued functions x(t), y(t), z(t). Call 
r(t) = (x(t), y(t), z(t)) the position vector of the object. We can define various physical quan- 
tities associated with the object as follows: 


f(t) = O, f(t) = e ) (for n = 2,3,4,...) 


position: r(t) = (x(t), y(t),2(t)) 
velocity: v(t) = F(t) = r'(t) = 0 


= (4(0, y'(t),z' (1)) 


d 
acceleration: a(t) = v(t) = v(t) = 


dr 


= F(t)=Fr ()= wg 


= (x(t), y"(t),2" ))غ(‎ 


momentum: p(t) = mv(t) 
d 
force: F(t) = p(t) = p(t) = 7 (Newton’s Second Law of Motion) 


The magnitude || v(t)|| of the velocity vector is called the speed of the object. Note that since 
the mass m is a constant, the force equation becomes the familiar F(t) = ma(t). 


Example 1.39. Let r(t) = (5cost,3sint,4sint) be the position vector of an object at time t > 0. 
Find its (a) velocity and (b) acceleration vectors. 

Solution: (a) v(t) = F(t) = (-5sint, 3cost,4cost) 

(b) a(t) = v(t) = (-Šcost,-3sint,—4sint) 

Note that |lr(t)|l = V25cos2t + 25sint = 5 for all t, so by Example 1.37 we know that r(t)- 
r(t) = 0 for all t (which we can verify from part (a)). In fact, |lv(t)|l = 5 for all t also. And not 
only does r(t) lie on the sphere of radius 5 centered at the origin, but perhaps not so obvious 
is that it lies completely within a circle of radius 5 centered at the origin. Also, note that 
a(t) = —r(t). It turns out (see Exercise 16) that whenever an object moves in a circle with 
constant speed, the acceleration vector will point in the opposite direction of the position 
vector (i.e. towards the center of the circle). 


14We will often use the older dot notation for derivatives when physics is involved. 
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Recall from Section 1.5 that ifr,, r, are position vectors to distinct points then r, +t(rg—Fr,) 
represents a line through those two points as t varies over all real numbers. That vector 
sum can be written as (1 — t)r, + tr. So the function l(t) = (1 - t)r, + tr, is a line through 
the terminal points of rı and r,, and when ft is restricted to the interval [0, 1] it is the line 
segment between the points, with 1(0) =r, and 1(1) =r. 

In general, a function of the form f(t) = (at + bD,,aat + b,,at + b4) represents a line in R3. A 
function of the form f(t) = (at + bt + c,,agt2 + bat + cg,agt2 + bgt + c;) represents a (possibly 
degenerate) parabola in RÎ. 


Example 1.40. Bézier curves are used in Computer Aided Design (CAD) to approximate 
the shape of a polygonal path in space (called the Bézier polygon or control polygon). For 
instance, given three points (or position vectors) bo, Dı, Db» in RF, define 


b(t) = (1 t)b, + tb, 
b(t) = (1-t)b, + tb» 
b(t) = (1- t)bj(t) + tb(t) 
= (1- tb, + 2t(1- t)b, + tb, 


for all real t. For t in the interval [0, 1], we see that b(t) is the line segment between b, and 
b,, and b(t) is the line segment between b, and b,. The function b(t) is the Bézier curve 
for the points bo, bı, b,. Note from the last formula that the curve is a parabola that goes 
through bo (when t = 0) and b„ (when t = 1). 

As an example, let bı = (0,0,0), b, = (1,2,3), and b„ = (4,5,2). Then the explicit formula for 
the Bézier curve is b(t) = (2t + 21,4t +t?,6t— 4t), as shown in Figure 1.8.4, where the line 
segments are bj(t) and b(t), and the curve is b(t). 


)1,2,3( 


Figure 1.8.4 Bézier curve approximation for three points 
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In general, the polygonal path determined by " > 3 noncollinear points in RF can be used 
to define the Bézier curve recursively by a process called repeated linear interpolation. This 
curve will be a vector-valued function whose components are polynomials of degree n —1, 
and its formula is given by de Casteljau’s algorithm.5 In the exercises, the reader will be 
given the algorithm for the case of n = 4 points and asked to write the explicit formula for 
the Bézier curve for the four points shown in Figure 1.8.5. 


r 


7 


Figure 1.8.5 Bézier curve approximation for four points 


Exercises 
A 
For Exercises 1-4, calculate f'(t) and find the tangent line at f(0). 
1. f= (t+1, + 1,3 +1) 2. f(t) = (e +1,e%* +1,e +1) 
3. f(t) = (cos2t,sin2t,t) 4. f(t) = (sin 2t, 2 sin t,2 cost) 


For Exercises 5-6, find the velocity v(t) and acceleration a(t) of an object with the given 
position vector r(t). 


5. r(t) = (t,t —sint, 1— cost) 6. r(t) = (3cost,2sint, 1) 
B 
cost sint -at 
1. Let f(1) = | n _ xwu -_| witha 20. 
(rE V1 + a2t2 e) 


(a) Show that ||lf(t)|| = 1 for all t. 
(b) Show directly that f(t) f(t) = 0 for all t. 


8. If f(t) = 0 for all t in some interval (a,b), show that f(t) is a constant vector in (a,b). 


15See pp. 27-30 in FARIN. 
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9. For a constant vector e # 0, the function f(t) = te represents a line parallel to c. 
(a) What kind of curve does g(t) = te represent? Explain. 
(b) What kind of curve does h(t) = ec represent? Explain. 
(c) Compare f(0) and g'(0). Given your answer to part (a), how do you explain the 
difference in the two derivatives? 
d df df 
10. Show that f x 7) E 


11. Let a particle of (constant) mass m have position vector r(t), velocity v(t), acceleration 
a(t) and momentum p(t) at time t. The angular momentum L(t) of the particle with 
respect to the origin at time t is defined as L(t) = r(t) x p(t). If F(t) is the force acting on 
the particle at time t, then define the torque N(t) acting on the particle with respect to 
the origin as N(t) = r(t) x F(t). Show that L'(t) = N(t). 


d df dg dh 
12. Show that (f(g x h) = (gx h) + f × hn) + f (e × ا‎ 


13. The Mean Value Theorem does not hold for vector-valued functions: Show that for f(t) = 


(cost,sint,t), there is no t in the interval (0, 2r) such that 
f(2r)— f(0) 
f) 

0 2r—0 


C 


14. The Bézier curve b(t) for four noncollinear points bo, bı, D,, b; in RF is defined by the 
following algorithm (going from the left column to the right): 


b(t) = (1—-t)b, +tbı, b(t) = (1-t)b(t) + tb(t) b(t) = (1 - t)b(t) + tb(t) 
b(t)= (1—-t)b, +tb, b(t) = (1 t)b}(t) + tb}(t) 
b}(t) = (1-t)b» + tbs 
(a) Show that b(t) = (1- t)3b, + 3(1 t)b, + 3t%(1 - t)b, + t3b,. 
(b) Write the explicit formula (as in Example 1.40) for the Bézier curve for the points 
bo = (0,0,0), b, = (0,1, 1), b = (2,3,0), bs = (4,5,2). 


15. Let r(t) be the position vector for a particle moving in R3. Show that 


(x (v x r)) = |lrllîa + (r-v)v — (| vl +r-a)r. 


16. Let r(t) be the position vector in RF for a particle that moves with constant speed c > 0 
in a circle of radius a > 0 in the xy-plane. Show that a(t) points in the opposite direction 
as r(t) for all t. (Hint: Use Example 1.37 to show that r(t) L v(t) and a(t) L v(t), and hence 
a(t) || r(t).) 


17. Prove Theorem 1.20(g). 
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1.9 Arc Length 


Let r(t) = (x(t), y(t), z(t)) be the position vector of an object moving in R3. Since |l v(t)|l is the 
speed of the object at time f, it seems natural to define the distance s traveled by the object 
from time t = a to t = b as the definite integral 


b b 
= | lvl at = | \/ x(t) + y'(t)2 + z(t) dt, (1.40) 


which is analogous to the case from single-variable calculus for parametric functions in RZ. 
This is indeed how we will define the distance traveled and, in general, the arc length of a 
curve in RÎ. 


Definition 1.13. Let f(t) = (x(t), y(t),z(t)) be a curve in R3 whose domain includes the inter- 
val [a,b]. Suppose that in the interval (a, b) the first derivative of each component function 
x(t), y(t) and z(t) exists and is continuous, and that no section of the curve is repeated. Then 
the arc length L of the curve from tf = a to t = b is 


b b 
1= | at = | (2 + y(t) + 2(2 dt (1.41) 


A real-valued function whose first derivative is continuous is called continuously differ- 
entiable (or a €` function), and a function whose derivatives of all orders are continuous 
is called smooth (or a 6® function). All the functions we will consider will be smooth. A 
smooth curve f(t) is one whose derivative f(t) is never the zero vector and whose component 
functions are all smooth. 

Note that we did not prove that the formula in the above definition actually gives the 
length of a section of a curve. A rigorous proof requires dealing with some subtleties, nor- 
mally glossed over in calculus texts, which are beyond the scope of this book." 


Example 1.41. Find the length L of the helix f(t) = (cost, sint,t) from t = 0 to t = 27. 
Solution: By formula (1.41), we have 


2r 2r 2r 
1= | V(- sind + (cos + at = | Vsin2t+ cost + 1t = | V2dt 
0 0 0 
= V2(27 - 0) = 227 


Similar to the case in R, if there are values of t in the interval [a, b] where the derivative 
of a component function is not continuous then it is often possible to partition [a, b| into 
subintervals where all the component functions are continuously differentiable (except at 
the endpoints, which can be ignored). The sum of the arc lengths over the subintervals will 
be the arc length over [a,b]. 


16In particular, Duhamel’s principle is needed. See the proof in TAYLOR and MANN, § 14.2 and § 18.2. 
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Notice that the curve traced out by the function f(t) = (cost,sint,t) from Example 1.41 is 
also traced out by the function g(t) = (cos 2t,sin 2t, 2t). For example, over the interval [0,7|, 
g(t) traces out the same section of the curve as f(t) does over the interval [0,27]. Intuitively, 
this says that g(t) traces the curve twice as fast as f(t). This makes sense since, viewing the 
functions as position vectors and their derivatives as velocity vectors, the speeds of f(t) and 
g(t) are |lf'()|l = V2 and |lg'(|| = 2۷2, respectively. We say that g(t) and f(t) are different 
parametrizations of the same curve. 


Definition 1.14. Let C be a smooth curve in R represented by a function f(t) defined on an 
interval [a,b], and let a: [c,d] ¬ [a,b| be a smooth one-to-one mapping of an interval [c,d] 
onto [a,b]. Then the function g : [c,d] + R3 defined by g(s) = f(a(s)) is a parametrization of 
C with parameter s. If a is strictly increasing on [c,d] then we say that g(s) is equivalent 
to f(t). 


8 t f(t) 
[c,d] ]a,b[ R3 


g(s) = f(a(s)) = f(t) 


Note that the differentiability of g(s) follows from a version of the Chain Rule for vector- 
valued functions (the proof is left as an exercise): 


Theorem 1.21. Chain Rule: If f(t) is a differentiable vector-valued function of t, and t = 
«a(s) is a differentiable scalar function of s, then f(s) = f(a(s)) is a differentiable vector-valued 
function of s, and 

df dfdt 

ds dtds 
for any s where the composite function f(a(s)) is defined. 


(1.42) 


Example 1.42. The following are all equivalent parametrizations of the same curve: 


f(t) = (cost,sint,t) for t in [0,27] 
g(s) = (cos2s,sin2s,2s) for s in [0,r| 


h(s) = (cos2rs,sin2rs,2rs) for s in [0,1] 


To see that g(s) is equivalent to f(t), define a :[0,r] — [0,27] by a(s) = 2s. Then «a is smooth, 
one-to-one, maps [0,7] onto [0,27], and is strictly increasing (since a'(s) = 2 > 0 for all s). 
Likewise, defining «a :[0,1| — [0,27] by a(s) = 2rrs shows that h(s) is equivalent to f(t). 
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A curve can have many parametrizations, with different speeds, so which one is the best 
to use? In some situations the arc length parametrization can be useful. The idea behind 
this is to replace the parameter t, for any given smooth parametrization f(t) defined on [a,b], 
by the parameter s given by 


t 
ّ =0 = | If (Il du. (1.43) 


In terms of motion along a curve, s is the distance traveled along the curve after time t 
has elapsed. So the new parameter will be distance instead of time. There is a natural 
correspondence between s and t: from a starting point on the curve, the distance traveled 
along the curve (in one direction) is uniquely determined by the amount of time elapsed, and 
vice versa. 

Since s is the arc length of the curve over the interval [a,t| for each t in [a,b], then it is a 
function of t. By the Fundamental Theorem of Calculus, its derivative is 


t 
s')t(= > | If Cu)|l du = |lf'(D|l for all t in [a,b]. 


Since f(t) is smooth, then |lf’()|l| > 0 for all t in [a,b]. Thus s(t) > 0 and hence s(t) is strictly 


increasing on the interval [a,b]. Recall that this means that s is a one-to-one mapping of the 
interval [a, b] onto the interval [s(a),s(b)]. But we see that 


a b 
= | If(wlldu =0 and (= | If (u)|l du = L = arc length from t= a tot = b 
a a 


So the function s : [a,b] — [0,L] is a one-to-one, differentiable a(s() 
mapping onto the interval [0,L]. From single-variable calculus, 
we know that this means that there exists an inverse function [0,Ll [a,b] 
«a :[0,L| — [a,b] that is differentiable and the inverse of s : [a, |] —~ د‎ 


[0,L]. That is, for each t in [a,b] there is a unique s in [0,L| such 
that s = s(t) and t = a(s). And we know that the derivative of «a is Figure 1.9.1 t= a(s) 


آ ےر 
|))ئ(lf'(a| s')a)s))‏ 


So define the arc length parametrization f: [0,L] —~ RF by 


a'(s(= 


f(s) = f(a(s)) for all s in [0,LJ]. 
Then f(s) is smooth, by the Chain Rule. In fact, f(s) has unit speed: 


f(s) = f'(a(s))a'(s) by the Chain Rule, so 
1 
=f د‎ 
e lf'(a(sDll 


IIf'(s)|l = 1 for all s in [0,L]. 


So the arc length parametrization traverses the curve at a “normal” rate. 
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In practice, parametrizing a curve f(t) by arc length requires you to evaluate the integral 
s = JIlf(u)|l du in some closed form (as a function of ¢) so that you could then solve for ¢ in 
terms of s. If that can be done, you would then substitute the expression for t in terms of s 
(which we called «a(s)) into the formula for f(t) to get f(s). 


Example 1.43. Parametrize the helix f(t) = (cost,sint,t), for t in [0,27], by arc length. 
Solution: By Example 1.41 and formula (1.43), we have 


t 
| LOL V2du = V2t for all t in [0, 21. 
0 0 


1 8 
So we can solve for t in terms of s: t= qa(s)= —. 


۷2 


8 ) for all s in [0,2۷27]. Note that |lf'(s)|l = 1. 


'. f)s(= (cos ,وھ‎ 
۷2 V2 2 
Arc length plays an important role when discussing curvature and moving frame fields, 
in the field of mathematics known as differential geometry."" The methods involve using 
an arc length parametrization, which often leads to an integral that is either difficult or 
impossible to evaluate in a simple closed form. The simple integral in Example 1.43 is 
the exception, not the norm. In general, arc length parametrizations are more useful for 
theoretical purposes than for practical computations." Curvature and moving frame fields 
can be defined without using arc length, which makes their computation much easier, and 
these definitions can be shown to be equivalent to those using arc length. We will leave this 
to the exercises. 
The arc length for curves given in other coordinate systems can also be calculated: 


Theorem 1.22. Suppose that r = r(t), 0 = 0(t) and z = z(t) are the cylindrical coordinates of 
a curve f(t), for t in [a,b]. Then the arc length L of the curve over [a,b] is 


b 
ا‎ Vr'(t)2 + r(t)201(t)2 + 21(t)2 dt (1.44) 


Proof: The Cartesian coordinates (x(t), y(t), z(t)) of a point on the curve are given by 
x(t) = r(t)cos0(t), y(t)=r(tsin0(t), z(t) = z(t) 
so differentiating the above expressions for x(t) and y(t) with respect to t gives 
x(t) =r (t)cos0(t)— r(t)0'(tsin0(t), y(t) = r'(t)sin0(t) + r(t)0'(t)cos0(t) 
1TSee O’NEILL for an introduction to elementary differential geometry. 
18For example, the usual parametrizations of Bézier curves, which we discussed in Section 1.8, are polynomial 


functions in R3. This makes their computation relatively simple, which, in CAD, is desirable. But their arc 
length parametrizations are not only not polynomials, they are in fact usually impossible to calculate at all. 
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and so 


x ($) + y'($% = (r'(cos0(t)— r(t)0'(t)sin0(¢))® + (r'(t)sin 0(t) + r(t)0'(t)cos0(t))* 
= r'(t)(cos0 + sin 0) + r(t)0'(t)(cos 0 + sin 0) 
- 2r'(t)r(t)0'(t)cos0 sinO + 2r'(t)r(t)0'(t)cos0 sin@ 
= r'(t) +r()0'(t) , and so 


b 
L =| \/ x(t) + y(t) + z(t) dt 


b 
= | Vr'(t) + r(t)20'(t) + 2'(t)2 dt QED 


Example 1.44. Find the arc length L of the curve whose cylindrical coordinates are r = e’, 


0=t and z =e’, for t over the interval [0, 1]. 
Solution: Since r'(t) = e“, 0'(t) = 1 and z(t) = e’, then 
1 
E Vr'(t)2 + r(t)20(t)2 + z1(t)2 dt 
0 
1 
| Vet + et(1)+ edt 
0 


1 
| e’ V3dt = V3(e-1) 
0 


Exercises 
A 


For Exercises 1-3, calculate the arc length of f(t) over the given interval. 
1. f(t) = (3cos2t,3sin 2t, 3t) on [0, 1/2] 

2. f(t) = ((t + Decost,(t% + 1)sint, 2V2) on [0,1] 

3. f(t) = (2cos3t,2sin3t, 232) on [0,11] 

4. Parametrize the curve from Exercise 1 by arc length. 

5. Parametrize the curve from Exercise 3 by arc length. 

B 

6. Let f(t) be a differentiable curve such that f(t) z 0 for all t. Show that 


d f(t) | 
dtl lfDOlI If) 
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Exercises '7-9 develop the moving frame field T, N, B at a point on a curve. 


71. Let f(t) be a smooth curve such that f(t) ¥ 0 for all t. Then we can define the unit tangent 
vector T by 
f(t) 

T()= 
IFC 


Show that 
f(t) x (f(t) x f'()) 


T(t) = 
8 EOE 


8. Continuing Exercise 7, assume that f(t) and f”(t) are not parallel. Then T'(t) ¥ 0 so we 
can define the unit principal normal vector N by 


T(t) 
N() = 3 
0= TT] 


Show that 
_ f(t) x (f(t) x f'(E)) 


N(t) = : 
IfTCEIl If"(E) x f£'(t)|| 
9. Continuing Exercise 8, the unit binormal vector B is defined by 
B(t)= T(E) x N(t). 
Show that 
f(t) x f(t) 
fC x f"(E)IN 


Note: The vectors T(t), N(t) and B(t) form a right-handed system of mutually perpendic- 
ular unit vectors (called orthonormal vectors) at each point on the curve f(t). 


B(t)= 


10. Continuing Exercise 9, the curvature « is defined by 


ITON _ fC) x (f"(E) x f(D 


0= el OSE 


Show that 
2 IIf'(E) x f(t)I| 


TIDE 


Note: x(t) gives a sense of how “curved” the curve f(t) is at each point. 


and that T(t) = ||f(E)|| x(N(). 


11. Find T, N, B and « at each point of the helix f(t) = (cost,sint,t). 


12. Show that the arc length L of a curve whose spherical coordinates are p = p(t), 0 = 0(t) 
and Q = ¢(t) for t in an interval [a, b] is 


b 
L= 1 0(0 + (p(t) sin p(t))0'(t)2 + p(t)2p'(t)® dt. 


2 Functions of Several Variables 


2.1 Functions of Two or Three Variables 


In Section 1.8 we discussed vector-valued functions of a single real variable. We will now 
examine real-valued functions of a point (or vector) in R or R3. For the most part these 
functions will be defined on sets of points in R, but there will be times when we will use 
points in R3, and there will also be times when it will be convenient to think of the points as 
vectors (or terminal points of vectors). 

A real-valued function f defined on a subset D of R is a rule that assigns to each point 
(x,y) in D a real number f(x,y). The largest possible set D in RÛ on which f is defined is 
called the domain of f, and the range of f is the set of all real numbers f(x,y) as (x,y) 
varies over the domain D. A similar definition holds for functions f(x, y,z) defined on points 
(x,y,z) in RÎ. 


Example 2.1. The domain of the function 


f(x, y)= xy 


is all of R, and the range of f is all of R. 


Example 2.2. The domain of the function 


1 
f)x,ل(‎ = -—— 


NY 


is all of R except the points (x, y) for which x = y. That is, the domain is the set D = {(, 7): 
x ¥ Jy}. The range of f is all real numbers except 0. 


Example 2.3. The domain of the function 


(= 1-4-۶ 


is the set D = {(x,y):x + y% < 1}, since the quantity inside the square root is nonnegative if 
and only if 1- (x + y2) > 0. We see that D consists of all points on and inside the unit circle 
in R® (D is sometimes called the closed unit disk). The range of f is the interval [0, 1] in R. 
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Example 2.4. The domain of the function 


x+y-z 


f(t,y,2)=e 


is all of R3, and the range of f is all positive real numbers. 


A function f(x, y) defined in RÎ is often written as z = f(x,y), as was mentioned in Section 
1.1, so that the graph of f(x,y) is the set {(x,y,2):2z = f(x,y)} in R3. So we see that this 
graph is a surface in RF, since it satisfies an equation of the form F(x<,y,z) = 0 (namely, 
F(x,y,z2) = f(x,y)—-z). The traces of this surface in the planes z = c, where c varies over R, 
are called the level curves of the function. Equivalently, the level curves are the solution 
sets of the equations f(x, y) = c, for c in R. Level curves are often projected onto the xy-plane 
to give an idea of the various “elevation” levels of the surface (as is done in topography). 


Example 2.5. The graph of the function 
sin J22 + y2 
x2 + 2ر‎ 


is shown below. Note that the level curves (shown both on the surface and projected onto the 
xy-plane) are groups of concentric circles. 


f(x,y) = 


sin x2 +y2 


Figure 2.1.1 The function f(x,y) = 


You may be wondering what happens to the function in Example 2.5 at the point (x,y) = 
(0,0), since both the numerator and denominator are O0 at that point. The function is not 
defined at (0,0), but the Jimit of the function exists (and equals 1) as (x, y) approaches (0,0). 
We will now state explicitly what is meant by the limit of a function of two variables. 
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Definition 2.1. Let (a,b) be a point in R, and let f(x,y) be a real-valued function defined 
on some set containing (a, b) (but not necessarily defined at (a, b) itself). Then we say that 
the limit of f(x, y) equals L as (x, y) approaches (a, b), written as 


lim 1 = IL )2.1( 
(x,y)—(a,b 


if given any e > 0, there exists a 0 > 0 such that 
|f(x,y)-L| <e whenever 0< ۷= (+ )- (۶ <0. 


A similar definition can be made for functions of three variables. The idea behind the 
above definition is that the values of f(x,y) can get arbitrarily close to L (i.e. within € of 
L) if we pick (x,y) sufficiently close to (a, b) (i.e. inside a circle centered at (a, b) with some 
sufficiently small radius 60). 

If you recall the “epsilon-delta” proofs of limits of real-valued functions of a single variable, 
you may remember how awkward they can be, and how they can usually only be done easily 
for simple functions. In general, the multivariable cases are at least equally awkward to go 
through, so we will not bother with such proofs. Instead, we will simply state that when the 
function f(x, y) is given by a single formula and is defined at the point (a, b) (e.g. is not some 
indeterminate form like 0/0) then you can just substitute (x, y) = (a, b) into the formula for 
f(x,y) to find the limit. 


Example 2.6. 
xy (1)2) 2 
lim = = 
)رر(‎ 1,2( 2+ y2 12+22 5 
since f(%,Y)= 73 a8 is properly defined at the point (1, 2). 


The major difference between limits in one variable and limits in two or more variables 
has to do with how a point is approached. In the single-variable case, the statement “x — a” 
means that x gets closer to the value a from two possible directions along the real number 
line (see Figure 2.1.2(a)). In two dimensions, however, (x, y) can approach a point (a, b) along 
an infinite number of paths (see Figure 2.1.2(b)). 


47 
ا‎ 
> 4 ll . 
1 1 = 
0 a x 0 
(a xain R (b) (x,y) ¬ (a,b) in R 


Figure 2.1.2 “Approaching” a point in different dimensions 
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Example 2.17. 
lim ي‎ does not exist 
()x,yر(¬)0,0(‎ x^ + y 
Note that we can not simply substitute (x, y) = (0,0) into the function, since doing so gives an 
indeterminate form 0/0. To show that the limit does not exist, we will show that the function 
approaches different values as (x, y) approaches (0,0) along different paths in R2. To see this, 
suppose that (x, y) — (0,0) along the positive x-axis, so that y = 0 along that path. Then 


x0 


(= x2 ا‎ 2+02 


along that path (since x > 0 in the denominator). But if (x, y) ¬ (0,0) along the straight line 


y = x through the origin, for x > 0, then we see that 
2 
XY x 1 
3 > ج‎ > 
e 2+2 »۽‎ +7 2 


which means that f(x,y) approaches different values as (x, y) — (0,0) along different paths. 
Hence the limit does not exist. 


Limits of real-valued multivariable functions obey the same algebraic rules as in the 
single-variable case, as shown in the following theorem, which we state without proof. 
Theorem 2.1. Suppose that lim f(x,y) and lim g(x,y) both exist, and that k is 

(x,y(¬(a,b( (x,y(¬ (a,b) 


some scalar. Then: 


li + g),y([ = li ,7( | ± li 
(1 7 8 ۳ک ر لري ا 1 ا‎ J1 


O n NE ll f(y) 1 


(x,y(¬)a,b( e b) 


)( سنا ي‎ E iro fe,» || iran (9,)ع‎ | 


(x,y)—(a, (x,y(¬)a,b( (x,y(¬)a,b( 
(),7( 
fey) _ 2 (a, 4 . 
(dQ) lim if lim g(x,y)Z#0 
(ey) a,b) g),9() اim‎ عg)e,y(‎ ey b( 
(x,y(¬)a,b( 


(e) If|f(x,y)-Ll| = g(x,y) for all (x,y) and if lim  g(x,y)=0, then lim f(x,y)=L. 
(x,y(¬ (a,b) (x,y)¬= (a,b) 


4 


Note that in part (e), it suffices to have |f(x,y)—L| = g(x, y) for all (x, y) “sufficiently close” 
to (a, b) (but excluding (a, b) itself). 
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Example 2.8. Show that 
4 
lim =0. 
)رر(‎ )0,0( £2 + y 

Since substituting (x, y) = (0,0) into the function gives the indeterminate form 0/0, we need 
an alternate method for evaluating this limit. We will use Theorem 2.1(e). First, notice that 
y* = )/y(“ and so 0 > *ر‎ > )/ e + y2 (ˆ for al (e, .)ر‎ But (+ + y2)“ = (2 + y2). Thus, for 
all (x, y) Z (0,0) we have 


ر٣‎ 


x2 + y2 


(2 4 y2 


Ty 2 FY =0 as (e,9) (0,0). 


4 


Therefore lim ERE 0. 
(ey) )0,0( £2 + y 


Continuity can be defined similarly as in the single-variable case. 


Definition 2.2. A real-valued function f(x,y) with domain D in RÎ is continuous at the 
point (a,b) in D if e f(x,y) = f(a,b). We say that f(x, y) is a continuous function if 
5,y)¬(a, 


it is continuous at every point in its domain D. 


Unless indicated otherwise, you can assume that all the functions we deal with are con- 
tinuous. In fact, we can modify the function from Example 2.8 so that it is continuous on all 
of R. 


Example 2.9. Define a function f(x,y) on all of R as follows: 


0 if (%,y) = (0,0) 


f(x,y)= 4 
E 3 i(9) (0,0) 


Xx 


Then f(x,y) is well-defined for all (x, y) in RÎ (i.e. there are no indeterminate forms for any 
(x, 7)), and we see that 


b+ 
f(x,y)= a f(a,b) for (a,b) ¥ (0,0). 


lim 
(x,y(¬(a,b( 
So since 
lim f(x,y)=0= f(0,0) by Example 2.8, 
)x,y(¬+)0,0( 


then f(x, y) is continuous on all of RÛ. 
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Exercises 
A 
For Exercises 1-6, state the domain and range of the given function. 
2 د‎ 
1. f(x,y)=x*+ y-1 2. i ER 
x2 +1 
3. f(x,y)= x2 + y-4 4. f(x,y)= 


5. f(x,y,2) = sSin(xy2z) 6. f(x,y,2)= V(x - 1)(y2 - 1) 


For Exercises 7-18, evaluate the given limit. 


lim cos(xy) ع‎ 


: lim 
)x,ر(¬)0,0(‎ )x,ر(¬)0,0(‎ 
2_2 2 
x= x 
)x,ر(¬+)0,0(‎ x۶ + y (x,y(¬)0,0( x^ + y 
2 2 2 
x - 2xy + x 
TIL i 1 Ch 
(x,y(¬)1,-1( x-y (x,ر(¬)0,0(‎ x2 + y 
2_2 2 2 
x = x - 2y + 
18. lim 14. lim ZW 
(,y)—(1,1) = Y (0,0)س(ر,)‎ x-y 
y*sin(y) 1 2 2 1 
15. 16. lim (x +y“)cos| — 
)e,ر()0,0( ر + 2ے‎ (,y)—(0,0( xy 
x 
17. lim — 18. lim cos|— 
(,y)—(0,0( Y (,y)—(0,0) xy 
B 


19. Show that f(x,y) = E, for o > 0, is constant on the circle of radius r > 0 
centered at the origin. This function is called a Gaussian blur, and is used as a filter in 
image processing software to produce a “blurred” effect. 


20. uRoS that f(x,y) = f(y,x) for all (x,y) in R2. Show that f(x,y) = f(y,x) for all (x, y) in 

R^. 
21. Use the substitution r = Vx + y to show that 

N 
(+ )0,0( 2 + y2 

(Hint: You will need to use LHöpital’s Rule for single-variable limits.) 
C 
22. Prove Theorem 2.1(a) in the case of addition. (Hint: Use Definition 2.1.) 


23. Prove Theorem 2.1(b). 
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2.2 Partial Derivatives 


Now that we have an idea of what functions of several variables are, and what a limit of 
such a function is, we can start to develop an idea of a derivative of a function of two or more 
variables. We will start with the notion of a partial derivative. 


Definition 2.3. Let f(x, y) be a real-valued function with domain D in R۶, and let (a,b) be 
a point in D. Then the partial derivative of f at (a,b) with respect to x, denoted by 


0 
8 b), is defined as 
dx 


a DN 
h¬+0 h 


(2.2) چ 


0 
and the partial derivative of f at (a,b) with respect to y, denoted by e, b), is defined 


as 
of f(a,b+h)- f(a,b) 


.2 ا = کا 
)2.8( ا 


Note: The symbol O is pronounced “del”. 


Recall that the derivative of a function f(x) can be interpreted as the rate of change of 
that function in the (positive) x direction. From the definitions above, we can see that the 
partial derivative of a function f(x, y) with respect to x or y is the rate of change of f(x, y) in 
the (positive) x or y direction, respectively. What this means is that the partial derivative of 
a function f(x,y) with respect to x can be calculated by treating the y variable as a constant, 
and then simply differentiating f(x,y) as if it were a function of x alone, using the usual 
rules from single-variable calculus. Likewise, the partial derivative of f(x, y) with respect to 
y is obtained by treating the x variable as a constant and then differentiating f(x, y) as if it 
were a function of y alone. 


0 0 
Example 2.10. Find و‎ and )و‎ for the function f(x,y) = xy + y3. 
x 4 
Solution: Treating y as a constant and differentiating f(x, y) with respect to x gives 


0 
و‎ = 2y 
dx 


and treating x as a constant and differentiating f(x, y) with respect to y gives 


of 


),y( = ر3 + م‎ : 
dy 


IIt is not a Greek letter. The symbol was first used by the mathematicians A. Clairaut and L. Euler around 
1740, to distinguish it from the letter d used for the “usual” derivative. 
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0 0 
instead of ۳ and Fe 
7y dx dy 


sin(y) 
x2 +1 


72 


0 0 
We will often simply write ۳ and ۳ 
x 


f 


0 0 
Example 2.11. Find a2 and ۳ for the function f(x,y) = 
3 y 


Solution: Treating y as a constant and differentiating f(x, y) with respect to x gives 


Of _ (x + 1)(y cos(xy2)) - (2) sin(xy%) 


(2 + 2 


Ox 


and treating x as a constant and differentiating f(x, y) with respect to y gives 


and E are themselves functions of x and y, we can take their partial 
04 


Of _2xy cos) ( 


x2 +1 


dy 


8 0 
Since both 4 


Ox 


derivatives with respect to x and y. This yields the higher-order partial derivatives: 


١ 


of of 
Qxdy 


and for the 


0f _ û (ûf 
y2 ل‎ 

of 8 2 
Qxdy Ox \dy 
of _ 0 ف‎ 
y3 0y 2را‎ 

f _ û (ûf 
0x y2 ا‎ 
of 0 ( of ) 
0x2dy dx \0xûy 
0f ل0‎ of 
Qydxdy E 


of of FF 
0x” ûy’ 0x2’ ûy’ Qyûx 


of _ û (of 
0x2 9 
of _ û (ûf 
ا‎ 
of _ 0 (of 
ا‎ 
f _ û لثûf‎ 
ا ×0 ر0‎ 
Of ل0‎ of 
y2 0x - ll 
0f ل0‎ of 
TE 


Example 2.12. Find the partial derivatives 


function f(x,y) = م‎ + xy. 
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0 
= 2e 7 + 3y 
dy 
0f 2 x2 
y 2 
ق‎ + 3x77) 
رجہ‎ ۷ 
x e” * +6xy 
of 2 x 
= 2 + ر3‎ 
Qxdy 0 0 


= 20*3 + 23 ye3 + 3y 


Solution: Proceeding as before, we have 


ر + 2xye™‏ ج ۳ 
( ر + rye‏ ک 
2ye 3 + 42 y2‏ ج 


= 2e + 2x3 ye” > + ر3‎ 


Higher-order partial derivatives that are taken with respect to different variables, such 


o2f 


2 
a8 3y, and e: are called mixed partial derivatives. Notice in the above example that 


Of _ @f 


2 
yr = Dry: It turns that this will usually be the case. Specifically, whenever both He and 


of 


axgy are continuous at a point (a,b), then they are equal at that point. All the functions 
we will deal with will have continuous partial derivatives of all orders, so you can assume in 


the remainder of the text that 


of _ of 


——— for all (x, y) in the domain of f. 


Qyûx . Qxdy 


In other words, it doesn’t matter in which order you take partial derivatives. This applies 
even to mixed partial derivatives of order 3 or higher. 
The notation for partial derivatives varies. All of the following are equivalent: 


D,(x,y), Dı(x,y) 


Dy(x,y), Dı(x,y) 
Da,.(x,y), Dı(%,y) 
D,yy(x,y), Ds2(%,y) 
D.y(z,y), Dıو(%,y)‎ 


D,4(%,Y) 9 Dzı(x,y) 


8 
i fat), fits), 
dx 
8 
fylzy), felt), 
dy 
of. 
۵x2 ` faslt.Y) ful), 
of 
ay2 fy), fa), 
of 
yox fay, ¥) 9 f(x,y) 9 
of 
xûy ` fal), fa(%.Y), 


2See pp. 214-216 in TAYLOR and MANN for a proof. 
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Exercises 
A 
For Exercises 1-16, find and 1. 
1. f)e,y( = 2ء‎ + y2 2. f(x,y) = cos)x + y( 
3. f(e,y)= Va +Y+4 4. رم‎ = 

y+1 
5. f(x,y)=e*™ + xy 6. f(x,y) = x7 - y2 + 6xy + 4x - 8y +2 
1. f(x,y) =» 8. f(x, J) =x +2y 
9. f),ر(‎ = 2 + 2ر‎ 10. f(x,y) = sin) + y( 
11. fey) = VaR +y+4 18. رم‎ 
x+y 
13. f(x,y)=e + 14. f(x,y) =1n(xy) 
15. f(x,y) = sin(xy) 16. f(x,y) = tan(x + y) 
For Exercises 17-26, find 1 and 2 (use Exercises 1-8, 14, 15). 
17. f)»,ر(‎ = 2 + ر‎ 18. f(x,y) = cos(x + y) 
19. f(.)= Vz +y+4 20. (= 
y+1 

21. f(x, y)=e* + xy 22. f)»,y( =x - y2 + 6xy + 4x - 8y + 2 
23. f),yر(=‎ ^ 24. f(x,y)=x+2y 
25. f(x,y) = In(xy) 26. f(x,y) = sin(xy) 
B 
27. Show that the function f(x, y) = sin(x + y)+ cos(x — y) satisfies the wave equation 

of of _ 

0x2 ay 


The wave equation is an example of a partial differential equation. 


28. Let u and u be twice-differentiable functions of a single variable, and let c Z 0 be a con- 
stant. Show that f(x,y) = u(x +cy)+ u(x — cy) is a solution of the general one-dimensional 


wave equation 
of 1 of _ 


0x2 c2 ay2 


3Conversely, it turns out that any solution must be of this form. See Ch. 1 in WEINBERGER. 
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2.3 Tangent Plane to a Surface 

In the previous section we mentioned that the partial derivatives and £ can be thought 
of as the rate of change of a function z = f(x, y) in the positive x and y directions, respectively. 
Recall that the derivative ر‎ of a function y = f(x) has a geometric meaning, namely as the 
slope of the tangent line to the graph of f at the point (x, f(+)) in R2. There is a similar 
geometric meaning to the partial derivatives 9£ and 5 of a function Zz = f(x,y): given a 
point (a,b) in the domain D of f(x, y), the trace of the surface described by z = f(x,y) in the 
plane y = b is a curve in RF through the point (a, b, f(a, b)), and the slope of the tangent line 
L, to that curve at that point is f (a,b). Similarly, (a,b) is the slope of the tangent line 
L, to the trace of the surface z = f(x,y) in the plane x = a (see Figure 2.3.1). 


42 12 ۵ 
2= f(x,y) slope = 3£(a,b) 
CAS 3), (ط‎ ۰ 
L, 2 
Î û + | ۹E ¥ 
0 0 
u a,b) اہ (طر) ا‎ 
KX D XxX D 
(a) Tangent line Ly in the plane y = b (b) Tangent line Ly in the plane x = a 


Figure 2.3.1 Partial derivatives as slopes 


Since the derivative ر‎ of a function y = f(x) is used to find the tangent line to the graph 
of f (which is a curve in R2), you might expect that partial derivatives can be used to define 
a tangent plane to the graph of a surface z = f(x,y). This indeed turns out to be the case. 
First, we need a definition of a tangent plane. The intuitive idea is that a tangent plane “just 
touches” a surface at a point. The formal definition mimics the intuitive notion of a tangent 
line to a curve. 


Definition 2.4. Let z = f(x,y) be the equation of a surface S in RF, and let P = (a,b,c) be 
a point on S. Let T' be a plane which contains the point P, and let Q = (x,y,z) represent a 
generic point on the surface S. If the (acute) angle between the vector PQ and the plane 
T' approaches zero as the point Q approaches P along the surface S, then we call T' the 
tangent plane to S at P. 


Note that since two lines in RF determine a plane, then the two tangent lines to the surface 
z = f(x,y) in the x and y directions described in Figure 2.3.1 are contained in the tangent 
plane at that point, if the tangent plane exists at that point. The existence of those two 
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tangent lines does not by itself guarantee the existence of the tangent plane. It is possible 
that if we take the trace of the surface in the plane x - y = 0 (which makes a 45° angle with 
the positive x-axis), the resulting curve in that plane may have a tangent line which is not 
in the plane determined by the other two tangent lines, or it may not have a tangent line 
at all at that point. Luckily, it turns out® that if ٣ and exist in a region around a point 
(a,b) and are continuous at (a, b) then the tangent plane to the surface z = f(x, y) will exist 
at the point (a,b, f(a, b)). In this text, those conditions will always hold. 


Suppose that we want an equation of the tangent plane T' 
to the surface z = f(x,y) at a point (a,b, f(a,b)). Let L, and F7 f 
L, be the tangent lines to the traces of the surface in the 
planes y = b and x = a, respectively (as in Figure 2.3.2), and 
suppose that the conditions for T' to exist do hold. Then the 
equation for T' is 


A(x —-a)+ B(y -b)+ C(z ¬ f(a,b))=0 (2.4) 


XxX 


where n = (A,B,C) is a normal vector to the plane T'. Since Figure 2.3.2 Tangent plane 


T' contains the lines L, and L,, then all we need are vectors v, and v, that are parallel to L, 
and L,, respectively, and then let n = v, Xx v,. 


Since the slope of L, is of (a,b), then the vector v, = (1,0, °۴ )a,ط((‎ is 
e 0 v, = (1,0, £(a,b)) 
parallel to L, (since v, lies in the xz-plane and lies in a line with slope ااا ا‎ 


(a,b) _ of Fi mae E 
T7 = 3. (a,b). See Figure 2.3.3). Similarly, the vector 


1 of 
Vv, = (0,1, (a, b)) is parallel to L,. Hence, the vector 1 3:), 0( 
1 XxX 
1 j k 0 1 
n=v,xv,=) 1 0 SE(a,0) |=-%(a,b)i- (a, b)j+k Figure 2.3.3 
of 
0 1 2y), 0( 
is normal to the plane T'. Thus the equation of T is 
- F(a, b)(e- a) $E(a,b)(y- b)+ 2 f(a,b) =0 . 2.5) 


Multiplying both sides by —-1, we have the following result: 
The equation of the tangent plane to the surface z = f(x,y) at the point (a, b, f(a, b)) is 


$E(a,b)(e- a) + $E(a,b)(y- b)- 2+ f(a,b)= 0 (2.6) 


4See TAYLOR and MANN, § 6.4. 
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Example 2.13. Find the equation of the tangent plane to the surface z = x2 + y% at the point 
(1,2,5). 

Solution: For the function f(x,y) = x + y, we have ا‎ 2x and $ = 2y, so the equation of 
the tangent plane at the point (1,2,5) is 


2(1)(x-1)+2(2)(y-2)—-z2+5 =0 , or 
2x+4y-z2—-5=0. 


In a similar fashion, it can be shown that if a surface is defined implicitly by an equation 
of the form F(x, y,z) = 0, then the tangent plane to the surface at a point (a,b,c) is given by 
the equation 

$E(a,b,c)(e-a)+ F(a, b,c)(y- b)+ $E(a,b,c)(2-c)=0 . (2.7) 


Note that formula (2.6) is the special case of formula (2.7) where F(x,y,2)= f(x,y)—z. 


Example 2.14. Find the equation of the tangent plane to the surface x + y2 + z = 9 at the 
point (2,2,—1). 


Solution: For the function F(x,y,z) = x + y% +z - 9, we have E = = 2% = 2y, and 9E - 2z, 


,’ = 

so the equation of the tangent plane at (2, 2,—-1) is 
2(2)(x - 2) + 2(2)(y - 2) + 2-1) + 1( = 0 , or 
2x+2y-z2-9=0. 
Exercises 

A 
For Exercises 1-6, find the equation of the tangent plane to the surface z = f(x,y) at the 
point P. 
1. f(x,y)=x7+ y3, P = (1,1,2) 2. f(x,y)= xy, P = (1,-1,-1( 
3. f(,y)= xy, P = (-1,1,1( 4. f(x,y) = xe”, P = (1,0,1) 
5. f(x,y)=x%+2y, P = (2,1,4) 6. f(x,y)= x2 + y2, P = )3,4,5( 
For Exercises 7-10, find the equation of the tangent plane to the given surface at the point 
P. 
7. E Bt P= (1.2, 4 ( 8. x7 + 2ر‎ +z = 9, P = )0,0,3( 


9. ×2 + ج - 2ر‎ = 0, ۶ = )8,4,5( 10. x+y =4, P = )۷3,1,0( 
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2.4 Directional Derivatives and the Gradient 


For a function z = f(x,y), we learned that the partial derivatives and £ represent the 
(instantaneous) rate of change of f in the positive x and y directions, respectively. What 
about other directions? It turns out that we can find the rate of change in any direction 


using a more general type of derivative called a directional derivative. 


Definition 2.5. Let f(x, y) be a real-valued function with domain D in R, and let (a, b) be a 
point in D. Let v be a unit vector in R. Then the directional derivative of f at (a,b) in 
the direction of v, denoted by D,f (a,b), is defined as 

(GG, b( E hv) 5 f(a, b) 


)2.8( ا 


Notice in the definition that we seem to be treating the point (a,b) as a vector, since we 
are adding the vector Av to it. But this is just the usual idea of identifying vectors with their 
terminal points, which the reader should be used to by now. If we were to write the vector v 
as V = (u,,U»), then 
f(a + hu,,b + hu»)— f(a, b) 


7 (2.9) 


D,f (a,b) = lim 

h¬0 

From this we can immediately recognize that the partial derivatives 3£ and 9£ are special 

cases of the directional derivative with v = i = (1,0) and v = j = (0, 1), respectively. That is, 

D;f and 9£ = D;f. Since there are many vectors with the same direction, we use a unit‏ = ف 
vector in the definition, as that represents a e vector for a given direction.‏ 


If f(x,y) has continuous partial derivatives Ş, and ف‎ (which will always be the case in 


this text), then there is a simple formula for the directional derivative: 


Theorem 2.2. Let f(x,y) be a real-valued function with domain D in R such that the 
partial derivatives and 5 exist and are continuous in D. Let (a,b) be a point in D, and 


let v = (v,,vu,) be a unit vector in R2. Then 


D,fta,b) =u, F(a,b)+u, (a,b) (2.10( 


dy 


Proof: Note that if v = i = (1,0) then the above formula reduces to D,f (a,b) = (a,b), 
which we know is true since D;f = م‎ as we noted earlier. Similarly, for v = j = (0, 1) the 
formula reduces to D,f (a,b) = (a,b), which is true since D;f = 1. So since i = (1,0) and 
j = (0,1) are the only unit vectors in RÛ with a zero component, then we need only show the 


formula holds for unit vectors v = (v,,u,) with vu, ¥ 0 and uv, ¥ 0. So fix such a vector v and 
fix a number Ah #0. 
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Then 
f(a +hu,,b + huaء)—‎ f(a,b) = f(a + hu,,b + hu» )— f(a + hu,,b)+ f(a + hu,,b)— f(a,b) . (2.11) 


Since A ¥ 0 and uv, # 0, then hus ¥ 0 and thus any number c between b and b + hu, can be 
written as c = b+ahu, for some number 0 < «a < 1. So since the function f(a+hu,, y) is a real- 
valued function of y (since a + hu, is a fixed number), then the Mean Value Theorem from 
single-variable calculus can be applied to the function g(y) = f(a + hu,, y) on the interval 
[b,b + hua] (or [Db + hu,,b] if one of h or u, is negative) to find a number 0 < «a < 1 such that 


4 


0 ت‎ 
gla + hun, b + ah) = g'(b + alı,) = g(0) _ f(a + hu,b + hus) f(a +hu1,b) 


b+huz—b hus 


and so 
f(a + hu,,b + hu»ء)—-‎ f(a +hu,,b) = hus g(a + hun, + ahs): 


By a similar argument, there exists a number 0 < Û < 1 such that 
of 
f(a +hu,,b)- f(a,b) = hur, (@ + Ghu,,b) . 
4 
Thus, by equation (2.11), we have 


f(a +hu,,b + hu,)- f (a,b) hus (a +huvٍ,b + ahuva) + hv, $E(a + Ghu1,b) 
h h 


8 0 
e + hu,,b + ahua) + a + Ghu,,b) 
dy dx 


so by formula (2.9) we have 


D,f(a,b) = lim ٤ E 


0 0 
= lim 0 + hu,,b + ahusa) + e + Ghu,,b) 
h¬~0| Oy dx 


0 0 0 0 
= BE CEB) by the continuity of ف‎ and i SO 
dy dx Ox dy 


D,f (a,b) EE a 
dx dy 


after reversing the order of summation. QED 


Note that D,f (a,b) = v-° (a,b), (a, b)). The second vector has a special name: 
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Definition 2.6. For a real-valued function f(x, y), the gradient of f, denoted by Vf, is the 


vector of of 
۴=), ( )2.12( 
dx dy 
in R®. For a real-valued function f(x, y,z), the gradient is the vector 
Of Of Of 
A n 2.13 
4 dy’ oz [ 


in R3. The symbol V is pronounced “del”.5 


Example 2.15. Find the directional derivative of f(x,y) = xy + x3y at the point (1,2) in the 
2 (1 1 
direction of v = (5 3 


Solution: We see that Vf = (y% + 3xy, 2xy + x3), so 


D,f(1,2) = v-Vf(1,2) = (3, 3) (2 + 8(1 (2), 212) + 1) ع‎ 


A real-valued function z = f(x,y) whose partial derivatives and 9۳ exist and are con- 
tinuous is called continuously differentiable. Assume that f(x, y) is such a function and that 
Vf #0. Let c be a real number in the range of f and let v be a unit vector in R which is 
tangent to the level curve f(x,y) = c (see Figure 2.4.1). 


7 vy Vf 


f(x,y)=c 


۰ 
ر 


ِ 


Figure 2.4.1 


5Sometimes the notation grad(/) is used instead of VF. 
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The value of f(x,y) is constant along a level curve, so since v is a tangent vector to this 
curve, then the rate of change of f in the direction of v is 0, i.e. D,f = 0. But we know that 
D,f =v°۰Vf = llvll II VF || cos0, where 0 is the angle between v and Vf. So since |l vl = 1 then 
D,f = ll Vf ll cos0. So since Vf # 0 then D,f = 0> cos0 = 0>0 =90°. In other words, Vf Lv, 
which means that Vf is normal to the level curve. 

In general, for any unit vector v in R2, we still have D,f = || Vf || cos@0, where 0 is the angle 
between v and Vf. At a fixed point (x, y) the length || Vf || is fixed, and the value of D,f then 
varies as Û varies. The largest value that D,f can take is when cosÛ0 = 1 (0 = 0°), while the 
smallest value occurs when cosO0 = —1 (0 = 180°). In other words, the value of the function 
f increases the fastest in the direction of Vf (since O0 = 0° in that case), and the value of 
f decreases the fastest in the direction of -Vf (since O0 = 180° in that case). We have thus 
proved the following theorem: 


Theorem 2.4. Let f(x,y) be a continuously differentiable real-valued function, with Vf Z 0. 
Then: 


(a) The gradient Vf is normal to any level curve f(x,y)=c. 
(b) The value of f(x, y) increases the fastest in the direction of Vf. 


(c) The value of f(x, y) decreases the fastest in the direction of Vf. 


Example 2.16. In which direction does the function f(x,y) = xy + x3y increase the fastest 
from the point (1,2)? In which direction does it decrease the fastest? 

Solution: Since Vf = (y2 + 3x2y,2xy + x3), then VF(,2) = (10,5) ¥ 0. A unit vector in that 
2 ا‎ 2 1 
V5? ¥5 V5 V5 1 
decreases the fastest in the direction of ( 7 =. 


direction is V = Th > ( ) Thus, f increases the fastest in the direction of ( and 


Though we proved Theorem 2.4 for functions of two variables, a similar argument can 
be used to show that it also applies to functions of three or more variables. Likewise, the 
directional derivative in the three-dimensional case can also be defined by the formula D,f = 
v-Vf. 


Example 2.17. The temperature T of a solid is given by the function T(x,y,z) = e * + e729 + 
e**, where x, y, z are space coordinates relative to the center of the solid. In which direction 
from the point (1, 1, 1) will the temperature decrease the fastest? 


Solution: Since Vf = (~e *,-2e72Y ,4e**), then the temperature will decrease the fastest in 
the direction of -Vf(1,1,1) = (e 1,2 2,-4e*). 
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Exercises 
A 
For Exercises 1-10, compute the gradient Vf. 
1 
2 > 
1. f(x,y)= x“ + y-1 2. E 
3. f(x,y) = V2 + y2 +4 4. f(x,y) = xe» 
5. f(x,y) =In(xy) 6. f(x,y) = 2x +5y 
1. f(x,y,z) = sin(xyz) 8. f(%,y,2) = xe 
9. f), yر,2(‎ = ×2 + 2ر + 2× = (2,ر,»)f .10 ج + 2ر‎ + 
For Exercises 11-14, find the directional derivative of f at the point P in the direction of 
E EE 
(4: 8): 
1 

11. f(x,y)=x7 + y-1, P = (1,1) 12. ,یی =(ر,)/‎ ۶ =)1,1( 

x+y 
13. f)»,ر(‎ = 2 + y2 + 4, P = )1,1( 14. f(x,y) = xe”, P = (1,1) 


For Exercises 15-16, find the directional derivative of f at the point P in the direction of 
)ا 1 5( ي‎ 
V3? V3? V8] 
15. f(x,y,z) = sin(xyz), P = (1,1,1) 16. f(x,y,2) = xe”, P = (1,1,1) 
17. Repeat Example 2.16 at the point (2,3). 
18. Repeat Example 2.17 at the point (3,1,2). 
B 


For Exercises 19-26, let f(x,y) and g(x,y) be continuously differentiable real-valued func- 
tions, let c be a constant, and let v be a unit vector in R2. Show that: 


19. V(cf)=cVf 20. V(f+g)=Vf +Vg 
21. Vfg=fVg+gVf 22. V(f/g)= E if gy) #0 
23. Df =-D,f 24. D„(cf)=cD,f 

25. D,(f+g)=D,f + D,g 26. D,(fg)=f D,g + gD,f 


27. The function r(x, y) = Vx + y2 is the length of the position vector r = xi+ yj for each 
point (x, y) in R2. Show that Vr = =F when (x, y) # (0,0), and that V(r) = 2r. 
r 
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2.5 Maxima and Minima 


The gradient can be used to find extreme points of real-valued functions of several variables, 
that is, points where the function has a local maximum or local minimum. We will consider 
only functions of two variables; functions of three or more variables require methods using 
linear algebra. 


Definition 2.7. Let f(x,y) be a real-valued function, and let (a,b) be a point in the domain 
of f. We say that f has a local maximum at (a, b) if f(x, y) = f(a, b) for all (x, y) inside some 
disk of positive radius centered at (a, b), i.e. there is some sufficiently small r > 0 such that 
f(x,y) = f (a,b) for all (x, y) for which (x — a) + (y— b) < r. 

Likewise, we say that f has a local minimum at (a,b) if f(x,y) > f(a,b) for all (x,y) 
inside some disk of positive radius centered at (a, b). 

If f(x,y) = f(a,b) for all (x,y) in the domain of f, then f has a global maximum at 
(a,b). If f(x,y) > f(a,b) for all (x,y) in the domain of f, then f has a global minimum at 
(a,b). 


Suppose that (a,b) is a local maximum point for f(x,y), and that the first-order partial 
derivatives of f exist at (a,b). We know that f(a,b) is the largest value of f(x,y) as (x,y) 
goes in all directions from the point (a, b), in some sufficiently small disk centered at (a,b). 
In particular, f(a, b) is the largest value of f in the x direction (around the point (a, b)), that 
is, the single-variable function g(x) = f(x,b) has a local maximum at x = a. So we know that 
g(a) = 0. Since g'(x) = f (e,0), then 9£ (a,b) = 0. Similarly, f(a, b) is the largest value of f 
near (a,b) in the y direction and so (a, b) = 0. We thus have the following theorem: 


Theorem 2.5. Let f(x, y) be a real-valued function such that both E b) and $a, b) exist. 
Then a necessary condition for f(x,y) to have a local maximum or minimum at (a, b) is that 
Vf(a,b) =0. 


Note: Theorem 2.5 can be extended to apply to functions of three or more variables. 
A point (a, b) where Vf (a, b) = 0 is called a critical point for the function f(x, y). So given 
a function f(x, y), to find the critical points of f you have to solve the equations (e, 7y(=0 


and (o, y) = 0 simultaneously for (x,y). Similar to the single-variable case, the necessary 
condition that Vf(a, b) = 0 is not always sufficient to guarantee that a critical point is a local 
maximum or minimum. 


Example 2.18. The function f(x,y) = xy has a critical point at (0,0): £ =y=0> y =0, and 
3 = x% = 03> x = 0, so (0,0) is the only critical point. But clearly f does not have a local 
maximum or minimum at (0,0) since any disk around (0, 0) contains points (x, y) where the 


values of x and y have the same sign (so that f(x,y) = xy > 0 = f(0,0)) and different signs (so 
that f(x,y) = xy < 0 = f(0,0)). In fact, along the path y = x in R®, f(x,y) = x7, which has a 
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local minimum at (0,0), while along the path y = -x we have f(x,y) = x7, which has a local 
maximum at (0, 0). So (0,0) is an example of a saddle point, i.e. it is a local maximum in one 
direction and a local minimum in another direction. The graph of f(x, y) is shown in Figure 
2.5.1, which is a hyperbolic paraboloid. 


100 


Figure 2.5.1 f(x,y) = xy, saddle point at (0,0) 


The following theorem gives sufficient conditions for a critical point to be a local maximum 
or minimum of a smooth function (i.e. a function whose partial derivatives of all orders exist 
and are continuous), which we will not prove here. 


Theorem 2.6. Let f(x, y) be a smooth real-valued function, with a critical point at (a, b) (i.e. 
Vf(a,b) = 0). Define 

0f 
Qyûx 


y2 


pal 
0x2 


(a,b) > 5(a,b)- ( (a, 0)) 
Then 

(a) if D >0 and (a,b) > 0, then f has a local minimum at (a, b) 
(b) if D >0 and ab) < 0, then f has a local maximum at (a, b) 


(c) if D < 0, then f has neither a local minimum nor a local maximum at (a, b) 


(d) if D =0, then the test fails. 


See TAYLOR and MANN, §7.6. 
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If condition (c) holds, then (a, b) is a saddle point. Note that the assumption that f(x, y) is 
smooth means that 


of of 

1 3 ( TE 
of of 
0 0 


2 2 2 2 2 
since اک‎ 2 Also, if D > 0 then (a,b) (a,b) = D+ (a, ))ھ‎ > 0, and so 3£(a,b) 


and £ (a, 0) have the same sign. This means that in parts (a) and (b) of the theorem one 


of of 2 : 
can replace g(a, b) by y2 (@, b) if desired. 


Example 2.19. Find all local maxima and minima of f(x,y) = x2 + xy + y2 - 83x. 


Solution: First find the critical points, i.e. where Vf = 0. Since 
0 0 
و‎ and اوا‎ 
dx dy 


then the critical points (x, y) are the common solutions of the equations 


2 y= =0 
x+2y =0 


which has the unique solution (x, y) = (2,—-1). So (2,1) is the only critical point. 
To use Theorem 2.6, we need the second-order partial derivatives: 


of, Of _ of _ 
0x2 ’ ر‎ ” y0 
and so ل‎ 4 
of “f 0“f 2 
D = -4)2,-1(-4)2,-1(- 2,-1(| = )2()2(-1 = 3 < 0 
پوو - 2-0و 2-0 چ‎ (= 2( > 
of 


and 322, —1)= 2 > 0. Thus, (2,—-1) is a local minimum. 
Example 2.20. Find all local maxima and minima of f(x,y) = xy — x3 — y%. 


Solution: First find the critical points, i.e. where Vf = 0. Since 


0 0 
E and او‎ 
0x dy 
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then the critical points (x, y) are the common solutions of the equations 


y-3 =0 
x- 2y =0 


The first equation yields y = 3x, substituting that into the second equation yields x-6x% = 0, 
which has the solutions x = 0 and x = 3. Sox=035> y= 3(0) = 0 and x = رج ج‎ = 3 = 5. 
So the critical points are (x, y) = (0,0) and (x,y) = (3, 5). 

To use Theorem 2.6, we need the second-order partial derivatives: 


of of of 
= =2, اسسا‎ 
0x2 0y yx 
٤ 2F 2 2 
TO OL e 
D = 350.0575 0:0( ( 50,0) = (-6(0(-2)-1 = -1 < 0 


and thus (0,0) is a saddle point. Also, 


ِ a i س‎ 


2 1 
1 1([ 2 
D = 2 6 12/ 2ق‎ 6 2 E) = ( 6)) 2-1 =1 <> 0 


2ق 
x2‏ 


1 


and (3 35) = -1<0. Thus, (3, ( is a local maximum. 


Example 2.21. Find all local maxima and minima of f(x,y) = (x - 2)* + (¢ — 2y). 


Solution: First find the critical points, i.e. where Vf = 0. Since 


2 = 4) - 2(5 +2(x-2y) and 2 = -4)x-2y( 
Ox dy 


then the critical points (x, y) are the common solutions of the equations 


4-2) + 2) - 2y( =0 
-4x-2y( =0 
The second equation yields x = 2y, substituting that into the first equation yields 4(2y- 2)3 = 


0, which has the solution y = 1, and so x = 2(1) = 2. Thus, (2, 1) is the only critical point. 
To use Theorem 2.6, we need the second-order partial derivatives: 


of Of _ 
y3 ` @yûx 
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So 
of  .. f ( of 


2 
2 ES 
D = 22 2-5, 5-2.) = (28)-(-4 = 0 


and so the test fails. What can be done in this situation? Sometimes it is possible to examine 
the function to see directly the nature of a critical point. In our case, we see that f(x,y) > 0 
for all (x, y), since f(x, y) is the sum of fourth and second powers of numbers and hence must 
be nonnegative. But we also see that f(2, 1) = 0. Thus f(x,y) > 0 = f(2,1) for all (x,y), and 
hence (2, 1) is in fact a global minimum for f. 


Example 2.22. Find all local maxima and minima of f(x,y) = (x + y2) + 3 


Solution: First find the critical points, i.e. where Vf = 0. Since 


0 

y2)‏ +( - 2(1 = ف 
dx‏ 

0 

ر + 1-2 )ر2 - 1 
dy‏ 


then the critical points are (0,0) and all points (x, y) on the unit circle x + y% = 1. 
To use Theorem 2.6, we need the second-order partial derivatives: 


02 
y2 ((] 7‏ + ) - 2)1 - 2 -( ر + ) -2]1 = ا 
2ق 
7 )ر + ) - 2)1 - ر2 -( ر + ) - 21 = ر 
( + )ے2 2 of‏ 
-4xy]2- )“ + ye y‏ = 
Qy dx‏ 


At (0,0), we have D = 4 > 0 and 10,0) = 2 > 0, so (0,0) is a local minimum. However, for 


points (x, y) on the unit circle x% + y% = 1, we have 
D = (-4xe (4ye ^)-(-4xye ^) = 0 


and so the test fails. If we look at the graph of f(x,y), as shown in Figure 2.5.2, it looks like 
we might have a local maximum for (x, y) on the unit circle x + y% = 1. If we switch to using 
polar coordinates (r,0) instead of (x, y) in R7, where r = x7 + y2, then we see that we can write 
f(x,y) as a function g(r) of the variable r alone: g(r) = re”. Then g(r) = 2r(1— r2)e 7”, 
so it has a critical point at r = 1, and we can check that g"(1) = 4e7" < 0, so the Second 
Derivative Test from single-variable calculus says that r = 1 is a local maximum. But r = 1 
corresponds to the unit circle x% + y% = 1. Thus, the points (x, y) on the unit circle x + y% = 1 
are local maximum points for f. 
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0.4 
0.35 
| 0.3 
a‏ 0 | 5 
چ ی کک 0 1 N‏ الور ا ا 
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ب 0 


TI) ک‎ 
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Figure 2.5.2 f(x,y) = (x2 + y2 (+ 


Exercises 
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A 


For Exercises 1-10, find all local maxima and minima of the function f(x, y). 


2. f)x,y( = - 12+ y2 + 8y 
4. f)»,ر(‎ = ر + 842 + 3ء‎ - 37 
6. f), y( = 23 - ر + ر6‎ 

8. f(x,y) =x +2y 


10. f)»,ر(‎ = -4x + 4y - 2y + 16× - 12y 


f),y) 
f(x,y) = x3 - 3x + y3 - 3y 


ر + 8 - 3م = 


f )e,y( = 23 + 6y + 3y 


ر + ےل = (ر,f)e‏ 


f )e,y( = 4 - 4y + 2y + 10x - 6y 


1. 


11. For a rectangular solid of volume 1000 cubic meters, find the dimensions that will min- 


imize the surface area. (Hint: Use the volume condition to write the surface area as a 
function of just two variables.) 


12. Prove that if (a,b) is a local maximum or local minimum point for a smooth function 
f(x,y), then the tangent plane to the surface z = f(x, y) at the point (a, b, f(a, b)) is parallel 


to the xy-plane. (Hint: Use Theorem 2.5.) 


C 


13. Find three positive numbers x, y, z whose sum is 10 such that xyz is a maximum. 
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2.6 Unconstrained Optimization: Numerical Methods 


The types of problems that we solved in the previous section were examples of unconstrained 
optimization problems. That is, we tried to find local (and perhaps even global) maximum 
and minimum points of real-valued functions f(x,y), where the points (x, y) could be any 
points in the domain of f. The method we used required us to find the critical points of f, 
which meant having to solve the equation Vf = 0, which in general is a system of two equa- 
tions in two unknowns (x and y). While this was relatively simple for the examples we did, 
in general this will not be the case. If the equations involve polynomials in x and y of degree 
three or higher, or complicated expressions involving trigonometric, exponential, or loga- 
rithmic functions, then solving even one such equation, let alone two, could be impossible by 
elementary means. 
For example, if one of the equations that had to be solved was 


x3 +9%-2=0, 


you may have a hard time getting the exact solutions. Trial and error would not help much, 
especially since the only real solution® turns out to be V728+1- VV28- 1. In a situation 
such as this, the only choice may be to find a solution using some numerical method which 
gives a sequence of numbers which converge to the actual solution. For example, Newton’s 
method for solving equations f(x) = 0, which you probably learned in single-variable calcu- 
lus. In this section we will describe another method of Newton for finding critical points of 
real-valued functions of two variables. 
Let f(x,y) be a smooth real-valued function, and define 


0f 0f ( 0f 2 


( ر ا ا 


Newton’s algorithm: Pick an initial point (xo, yo). For n = 0,1,2,3,..., define: 


2 2 2ق 2ق 
E (tn n) ey nn) hens n) Fn)‏ 
E (ens) E (en: ¥.) F(9.) 9 (en) i‏ 
n+1 = Mi © = TN TTC. = .‏ ,س Jj1] 3 Xj‏ 
D(n: Yn) a D(%n: ¥)‏ 1 
Then the sequence of points (%,,y,)%2, converges to a critical point. If there are several‏ 


critical points, then you will have to try different initial points to find them. 


"This is also a problem for the equivalent method (the Second Derivative Test) in single-variable calculus, 
though one that is not usually emphasized. 

SThere are also two nonreal, complex number solutions. Cubic polynomial equations in one variable can be 
solved using Cardan’s formulas, which are not quite as simple as the familiar quadratic formula. See USPENSKY 
for more details. There are formulas for solving polynomial equations of degree 4, but it can be proved that there 
is no general formula for solving equations for polynomials of degree five or higher. 
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90 


Example 2.23. Find all local maxima and minima of f(x,y) = x3 - xy — x + x y3 — y* 


Solution: First calculate the necessary partial derivatives: 


0 
ر4 - ر3 + - = 
0y‏ 


2 


Qyûx 


ر8 +1- 


6xy - 12y 


of 


Ox 
of _ 
x2 


,ر +1-رy-‏ 8= 


2 


6x 2 > 


9 


Notice that solving Vf = 0 would involve solving two third-degree polynomial equations in x 


and y, which in this case can not be done easily. 


We need to pick an initial point (xo, yo) for our algorithm. Looking at the graph ofz = f(x,y) 
over a large region may help (see Figure 2.6.1 below), though it may be hard to tell where 


چ 
TZ‏ 
E‏ 


the critical points are. 


50000 3 
0| 2 0 N 

-50000 8 SSN 
-100000 | چ ر مر‎ SSE 1 
ا ب‎ SSN 0 0 0 ۷ 
-2500 -15 
-30000 
ر350000-‎ 


20 20 


Figure 2.6.1 f(x,y)=x3-xy-x+ xy - y* for -20 < x < 20 and -20 < y < 20 


Notice in the formulas (2.14) that we divide by D, so we should pick an initial point where 
D is not zero. And we can see that D(0, 0) = (0)(0)-(-1)% = -1 # 0, so take (0,0) as our initial 
point. Since it may take a large number of iterations of Newton’s algorithm to be sure that 
we are close enough to the actual critical point, and since the computations are quite tedious, 
we will let a computer do the computing. For this, we will write a simple program, using 
the Java programming language, which will take a given initial point as a parameter and 
then perform 100 iterations of Newton’s algorithm. In each iteration the new point will be 
printed, so that we can see if there is convergence. The full code is shown in Listing 2.1. 
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//Program to find the critical points of f(Xx,y)=X^3-xy-x+xy^3-yJ^*4 
public class newton { 
public static void main(String[] args) 4 
//Get the initial point (x,y) as command-line parameters 
double x = Double.parseDouble(args[01]); //Initial x value 
double y = Double.parseDouble(args[1]); //Initial y value 
System. oUt .prinNtELn( Initial point: (CC FX TF TYE DOD), 
//Go through 100 iterations of Newton’s algorithm 
for (int n=1; n<=100; n++) { 
double D = fxx(x,y)x*fyy(x,y) - Math.pow(fxy(x,y¥),2); 
double xn = x; double yn = y; //The current x and y values 
if (D == 0) { //We can not divide by 0 
System.out.println("Error: D = 0 at iteration n = " + n); 
System.exit(0); //End the program 
} else { //Calculate the new values for x and y 


x = xn - (fyy(xn,yn)xfx(xn,yn) - fxy(xn,yn)x*fy(xn,yn))/D; 
y = yn - (fxx(xn,yn)xfy(xn,yn) - fxy(xn,yn)xfx(xn,yn))/D; 
System OUE paintln( MD ST TNT CCCTKT ST TVYVTE DVD), 
} 
} 
1 


//Below are the parts specific to the function f 
//The first partial derivative of f wrt x: 3x^2-y-1l+y^3 
public static double fx(double x, double y) { 
return 3*Math.pow(x,2) - y - 1 + Math.pow(y,3); 
//The first partial derivative of f wrt y: -x+3xy^2-4y^3 
public static double fy(double x, double y) { 
return -x + 3xx+Math.pow(y,2) - 4xMath.pow(y,3); 
} 
//The second partial derivative of f wrt XxX: 6x 
public static double fxx(double x, double y) { 
return 6*Xx; 
} 
//The second partial derivative of f wrt y: 6xy-12y^2 
public static double fyy(double x, double y) {4 
return 6*x*y - 12+*Math.pow(y,2); 
} 
//The mixed second partial derivative of f wrt x and y: -1+3y^2 
public static double fxy(double x, double y) {4 
return -1 + 3*Math.pow(y,2); 
} 


Listing 2.1 Program listing for newton.java 
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To use this program, you should first save the code in Listing 2.1 in a plain text file called 
newton. java. You will need the Java Development Kit® to compile the code. In the directory 
where newton. java is saved, run this command at a command prompt to compile the code: 
javac newton. java 
Then run the program with the initial point (0, 0) with this command: 
java newton 0 0 

Below is the output of the program using (0,0) as the initial point, truncated to show the 
first 10 lines and the last 5 lines: 


java newton O0 0 
Initial point: (0.0,0.0) 


n = 1: (0.0,-1.0( 

n = 2: (1.0,-0.5( 

n = 3: (0.6065857885615251, -0.44194107452339687( 
n = 4: (0.484506572966545, -0. 405341511995805 ( 

n = 5: (0.47123972682634485 , -0. 3966334583092305 ( 
n = 6: (0.47113558510349535, -0.39636450001936047( 
n = 7: (0.4711356343449705 , -0. 3963643379632247 ( 

n = 8: (0.4711356343449874, -0.39636433796318005( 
n = 9: (0.4711356343449874, -0.39636433796318005( 
n = 10: (0.4711356343449874, -0.39636433/96318005( 


= 96: (0.4711356343449874, -0.39636433796318005( 
= 97: )(0.4711356343449874, -0.39636433796318005( 

(0.4711356343449874, -0.39636433796318005( 
= 99: )(0.4711356343449874, -0.39636433796318005( 
= 100: )(0.4711356343449874, -0.39636433/96318005( 


e e is e e E 
Il 
e 
o0 


As you can see, we appear to have converged fairly quickly (after only 8 iterations) to 
what appears to be an actual critical point (up to Java’s level of precision), namely the point 
(0.4711356343449874,-0.39636433796318005(. | is easy to confirm that Vf = 0 at this 
point, either by evaluating 9£ and at the point ourselves or by modifying our program to 
also print the values of the partial derivatives at the point. It turns out that both partial 
derivatives are indeed close enough to zero to be considered zero: 

0 

° (0.4111356343449874, -0.39636433796318005( = 4.85722573273506 x 10 
x 

0 

° (0.4111356343449874, -0.39636433796318005( = -8.326672684688674 x 10" 
84 


We also have D(0.4711356343449874, -0.39636433796318005( = -8.776075636032301 > 0, 
so by Theorem 2.6 we know that (0.4711356343449874, -0.39636433796318005) is a saddle 
point. 


Available for free at http://www .oracle.com/technetwork/j ava/ j avase/downloads/ 
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Since Vf consists of cubic polynomials, it seems likely that there may be three critical 
points. The computer program makes experimenting with other initial points easy, and 
trying different values does indeed lead to different sequences which converge: 


java newton -1 -1 
Initial point: (-1.0,-1.0) 


n = 1: 5,-0.5( 


(-0. 

n = 2: (-0.49295774647887325, -0.08450704225352113( 
n = 3: (-0.18556747/52461383, -1.2047647348546167( 
n = 4: (-0.4540060574531383, -0. 8643989895639324) 
n = 5: (-0.3672160534444, -0.5426077421319053( 

n = 6: (-0.4794622222856417,-0.24529117721011612( 
n = 7: (0.11570743992954591, -2.4319791238981274( 
n = 8: (-0.05837/851765533317, -1.6536079835854451( 
n = 9: (-0.129841298650007,-1.121516233310142( 

n = 10: (-1.004453014967208 , -0. 9206128022529645 ( 
n = 11: (-0.5161209914612475, -0.4176293491131443( 
n = 12: (-0.5788664043863884,0.2918236503332734( 
n = 13: (-0.6985177124230715,0.49848120123515316( 
n = 14: (-0.6733618916578702,0.4345777963475479( 
n = 15: (-0.6704392913413444,0.4252025996474051( 
n = 16: (-0.6703832679150286,0.4250147307973365( 
n = 17: (-0.6703832459238701,0.42501465652421205( 
n = 18: (-0.6703832459238667,0.4250146565242004( 
n = 19: (-0.6703832459238667,0.42501465652420045( 
n = 20: (-0.6703832459238667,0.42501465652420045 ( 
n = 98: (-0.6703832459238667,0.42501465652420045( 
n = 99: (-0.6703832459238667,0.42501465652420045( 


n = 100: (-0.670383245923866/,0.42501465652420045 ( 


Again, it is easy to confirm that both £ and 9£ vanish at the point 
(-0.6703832459238667, 0.42501465652420045(, which means it is a critical point. And 


D(-0.6703832459238667, 0.42501465652420045( = 15.3853578526055 > 0 


e 0.6703832459238667, 0.42501465652420045( = -4.0222994755432 > 0 
so we know that (-0.6703832459238667, 0.42501465652420045) is a local maximum. An 
idea of what the graph of f looks like near that point is shown in Figure 2.6.2, which does 
suggest a local maximum around that point. 
Finally, running the computer program with the initial point (~5, -5) yields the critical 
point (-7.540962756992551,-5.595509445899435), with D < 0 at that point, which makes 
it a saddle point. 
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Figure 2.6.2 f(x,y)=x3-xy-x+xy3- y® for -1<x <O and Osy =1 


We can summarize our findings for the function f(x,y) = x3 — xy — x + x y3 — y*: 


(0.4711356343449874, -0.39636433796318005( : saddle point 
(-0.6703832459238667, 0.42501465652420045( : ا!بca!‎ maximum 
(-7.540962756992551, -5.595509445899485(: saddle point 


The derivation of Newton’s algorithm, and the proof that it converges (given a “reason- 
able” choice for the initial point) requires techniques beyond the scope of this text. See 
RALSTON and RABINOWITZ for more detail and for discussion of other numerical methods. 
Our description of Newton’s algorithm is the special two-variable case of a more general 
algorithm that can be applied to functions of n > 2 variables. 

In the case of functions which have a global maximum or minimum, Newton’s algorithm 
can be used to find those points. In general, global maxima and minima tend to be more 
interesting than local versions, at least in practical applications. A maximization problem 
can always be turned into a minimization problem (why?), so a large number of methods 
have been developed to find the global minimum of functions of any number of variables. 
This field of study is called nonlinear programming. Many of these methods are based on the 
steepest descent technique, which is based on an idea that we discussed in Section 2.4. Recall 
that the negative gradient Vf gives the direction of the fastest rate of decrease of a function 
f. The crux of the steepest descent idea, then, is that starting from some initial point, you 
move a certain amount in the direction of -Vf at that point. Wherever that takes you 
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becomes your new point, and you then just keep repeating that procedure until eventually 
(hopefully) you reach the point where f has its smallest value. There is a “pure” steepest 
descent method, and a multitude of variations on it that improve the rate of convergence, 
ease of calculation, etc. In fact, Newton’s algorithm can be interpreted as a modified steepest 
descent method. For more discussion of this, and of nonlinear programming in general, see 
BAZARAA, SHERALI and SHETTY. 


Exercises 


C 


1. Recall Example 2.21 from the previous section, where we showed that the point (2, 1) was 
a global minimum for the function f(x,y) = (x 2)* + (x— 2y). Notice that our computer 
program can be modified fairly easily to use this function (just change the return values 
in the fx, fy, fxx, fyy and fxy function definitions to use the appropriate partial derivative). 
Either modify that program or write one of your own in a programming language of your 
choice to show that Newton’s algorithm does lead to the point (2, 1). First use the initial 
point (0,3), then use the initial point (3, 2), and compare the results. Make sure that your 
program attempts to do 100 iterations of the algorithm. Did anything strange happen 
when your program ran? If so, how do you explain it? (Hint: Something strange should 
happen.) 


2. There is a version of Newton’s algorithm for solving a system of two equations 
fi(x,y)=0 and fa(x,y)=0, 


where f(x,y) and f(x,y) are smooth real-valued functions: 
Pick an initial point (xo, yo). For n = 0,1,2,3,..., define: 


fins Yn) f2(%ns Yn) f1(%ns Yn) fans Yn) 


of of: 5) 8 
dy nn) gy (nn) E en, Cn, n 
n+1 n D(n, Yn) 8 n+1 n D(en, yn) 8 
fı 0f: fı fz 


D ا ا ل ل ے‎ 
(Xn Yn) a (Xn Yn) 3y (Xn Yn) 7 (ns Yn) 2 (ns Yn) 


Then the sequence of points (%n,, J.2, converges to a solution. Write a computer program 
that uses this algorithm to find approximate solutions to the system of equations 


OO 
n 


f(x,y) = sin(xy)—-x—y =0 and f(x,y) = e*-2%+3y=0. 


Show that you get two different solutions when using (0,0) and (1, 1) for the initial point 
(%0. 0). 
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2.7 Constrained Optimization: Lagrange Multipliers 


In Sections 2.5 and 2.6 we were concerned with finding maxima and minima of functions 
without any constraints on the variables (other than being in the domain of the function). 
What would we do if there were constraints on the variables? The following example illus- 
trates a simple case of this type of problem. 


Example 2.24. For a rectangle whose perimeter is 20 m, find the dimensions that will max- 
imize the area. 


Solution: The area A of a rectangle with width x and height y is A = xy. The perimeter P of 
the rectangle is then given by the formula P = 2x+2y. Since we are given that the perimeter 
P = 20, this problem can be stated as: 


Maximize: f(x,y)= xy 
given: 2x + 2y = 20 


The reader is probably familiar with a simple method, using single-variable calculus, for 
solving this problem. Since we must have 2x + 2y = 20, then we can solve for, say, y in 
terms of x using that equation. This gives y = 10- x, which we then substitute into f to get 
f(x,y) = xy = x(10- x) = 10-2. This is now a function of x alone, so we now just have to 
maximize the function f(x) = 10x x% on the interval [0, 10]. Since f(t) = 10-2x=035> x =5 
and f"(5) = -2 < 0, then the Second Derivative Test tells us that x = 5 is a local maximum 
for f, and hence x = 5 must be the global maximum on the interval [0, 10| (since f = 0 at 
the endpoints of the interval). So since y = 10- x = 5, then the maximum area occurs for a 
rectangle whose width and height both are 5 m. 


Notice in the above example that the ease of the solution depended on being able to solve 
for one variable in terms of the other in the equation 2x + 2y = 20. But what if that were not 
possible (which is often the case)? In this section we will use a general method, called the 
Lagrange multiplier method"®, for solving constrained optimization problems: 


Maximize (or minimize): f(x,y) (or f(x<,y,Z)) 


given: g(x,y) =c (or g(x,y,2) = c) for some constant c 


The equation g(x, y) = c is called the constraint equation, and we say that x and y are con- 
strained by g(x,y) = c. Points (x,y) which are maxima or minima of f(x,y) with the con- 
dition that they satisfy the constraint equation g(x, y) = c are called constrained maximum 
or constrained minimum points, respectively. Similar definitions hold for functions of three 
variables. 

The Lagrange multiplier method for solving such problems can now be stated: 


1Û Named after the French mathematician Joseph Louis Lagrange (1736-1813). 
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Theorem 2.7. Let f(x,y) and g(x,y) be smooth functions, and suppose that c is a scalar 
constant such that Vg(x,y) ¥ 0 for all (x,y) that satisfy the equation g(x,y) = c. Then to 
solve the constrained optimization problem 


Maximize (or minimize): f(x,y) 


given: g(x,yJ)=c, 


find the points (x,y) that solve the equation Vf(x,y) = AVg(x,y) for some constant A (the 
number ۸ is called the Lagrange multiplier). If there is a constrained maximum or mini- 
mum, then it must be such a point. 


A rigorous proof of the above theorem requires use of the Implicit Function Theorem, 
which is beyond the scope of this text." Note that the theorem only gives a necessary con- 
dition for a point to be a constrained maximum or minimum. Whether a point (x, y) that 
satisfies Vf(x,y) = AVg(x, y) for some ۸ actually is a constrained maximum or minimum can 
sometimes be determined by the nature of the problem itself. For instance, in Example 2.24 
it was clear that there had to be a global maximum. 

So how can you tell when a point that satisfies the condition in Theorem 2.7 really is a 
constrained maximum or minimum? The answer is that it depends on the constraint func- 
tion g(x,y), together with any implicit constraints. It can be shown’ that if the constraint 
equation g(x, y) = c (plus any hidden constraints) describes a bounded set B in R/, then the 
constrained maximum or minimum of f(x,y) will occur either at a point (x, y) satisfying 
Vf(x,y)=AVg(x,y) or at a “boundary” point of the set B. 

In Example 2.24 the constraint equation 2x + 2y = 20 describes a line in RF, which by itself 
is not bounded. However, there are “hidden” constraints, due to the nature of the problem, 
namely 0 < x,y = 10, which cause that line to be restricted to a line segment in R® (including 
the endpoints of that line segment), which is bounded. 


Example 2.25. For arectangle whose perimeter is 20 m, use the Lagrange multiplier method 
to find the dimensions that will maximize the area. 


Solution: As we saw in Example 2.24, with x and y representing the width and height, 
respectively, of the rectangle, this problem can be stated as: 


Maximize: f(x,y)= xy 
given: g(x,y) = 2x + 2y = 0 


Then solving the equation Vf(x,y) = AVg(x, y) for some A۸ means solving the equations 


11See TAYLOR and MANN, $ 6.8 for more detail. 
12 Again, see TAYLOR and MANN. 
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د 
= 
o‏ 
1 


0 
۸ , namely: 
dy 


y= 2, 
xX = 21 


The general idea is to solve for A in both equations, then set those expressions equal (since 
they both equal A۸) to solve for x and y. Doing this we get 


so now substitute either of the expressions for x or y into the constraint equation to solve for 
x and y: 
20 = EGY SIFET = LFI =A, 53 gD 5 yb 


There must be a maximum area, since the minimum area is 0 and f(5,5) = 25 > 0, so the 
point (5, 5) that we found (called a constrained critical point) must be the constrained maxi- 
mum. 

.'. The maximum area occurs for a rectangle whose width and height both are 5 m. 


Example 2.26. Find the points on the circle x% + y = 80 which are closest to and farthest 
from the point (1,2). 


Solution: The distance d from any point (x, y) to the point (1, 2) is 


d= ¥ (x-1)2+ (y-2), 


and minimizing the distance is equivalent to minimizing the square of the distance. Thus 
the problem can be stated as: 


Maximize (and minimize): f(x,y) = (x — 1) + (y= 2) 


given: g(x,y) = 0 + ر‎ = 80 
Solving Vf(x,y) = AVg(x, y) means solving the following equations: 


2-1) = 2x, 
2(y-2( = 2y 


Note that x ¥ 0 since otherwise we would get —2 = 0 in the first equation. Similarly, y # 0. 
So we can solve both equations for A as follows: 


2 
۸= کج )٨2-/ں‏ = رر ج دد =۸ = 
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Substituting this into g(x,y) = x + y% = 80 yields 5x = 80, 42+ 2ر‎ = 80 k4 


so x = +4. So the two constrained critical points are (4,8) and 
(-4,—-8). Since f(4,8) = 45 and f(-4,-8) = 125, and since there 
must be points on the circle closest to and farthest from (1,2), 
then it must be the case that (4, 8) is the point on the circle clos- 
est to (1,2) and (-4,—8) is the farthest from (1,2) (see Figure 
2.7.1. 

Notice that since the constraint equation x+y = 80 describes 
a circle, which is a bounded set in R, then we were guaranteed 
that the constrained critical points we found were indeed the 
constrained maximum and minimum. 


4,8) 


#8 


(-4,-8( 


Figure 2.7.1 


The Lagrange multiplier method can be extended to functions of three variables. 


Example 2.27. 


Maximize (and minimize): f(x,y,2)=x+z 


1 گے + ر + 2 = (ے ,ر ,)ع given:‏ 


Solution: Solve the equation Vf (x, y,2) = AVg(x,y,2): 


1 = 2% 
0 = 2Ay 
1 = 2z 


The first equation implies A ¥ 0 (otherwise we would have 1 = 0), so we can divide by ۸ in the 
second equation to get y = 0 and we can divide by ۸ in the first and third equations to get 
ا‎ 2 = z. Substituting these expressions into the constraint equation g(x, y,z) = x + y2 + 


. 0L 1 
= 1 yields the constrained critical points (5 0 0, and (3 2° 0,2). Since f (5 ,0, (< 
f 1 0, (, and since the constraint equation x + y + z% = 1 describes a sphere (which is 


V2 
bounded) in RF, then (5 ° 0, بذ‎ is the constrained maximum point and (3 


constrained minimum point. 


2° 0, ( is the 


So far we have not attached any significance to the value of the Lagrange multiplier ۸. We 
needed ۸ only to find the constrained critical points, but made no use of its value. It turns 
out that A gives an approximation of the change in the value of the function f(x, y) that we 
wish to maximize or minimize, when the constant c in the constraint equation g(x,y) = c is 
changed by 1. 
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For example, in Example 2.25 we showed that the constrained optimization problem 
Maximize: f(x,y)= xy 
given: g(x,y) = 2x + 2y = 0 


had the solution (x, y) = (5,5), and that A = x/2 = y/2. Thus, A۸ = 2.5. In a similar fashion we 
could show that the constrained optimization problem 


Maximize: f(x,y)= xy 
given: g(x,y) = 2x + 2y = 21 


has the solution (x, y) = (5.25,5.25). So we see that the value of f(x,y) at the constrained 
maximum increased from f(5,5) = 25 to f(5.25,5.25) = 27.5625, i.e. it increased by 2.5625 
when we increased the value of c in the constraint equation g(x, y) = c from c = 20 to c= 21. 
Notice that A = 2.5 is close to 2.5625, that is, 


A zx Af = f(new max. pt) f(old max. pt) . 


Finally, note that solving the equation Vf (x,y) = AVg(x, y) means having to solve a system 
of two (possibly nonlinear) equations in three unknowns, which as we have seen before, 
may not be possible to do. And the 3-variable case can get even more complicated. All of 
this somewhat restricts the usefulness of Lagrange’s method to relatively simple functions. 
Luckily there are many numerical methods for solving constrained optimization problems, 
though we will not discuss them here. 


Exercises 


س 


„ Find the constrained maxima and minima of f(x,y) = 2x + y given that x + y = 4. 


2. Find the constrained maxima and minima of f(x,y) = xy given that x + 3y% = 6. 

3. Find the points on the circle x + y = 100 which are closest to and farthest from the point 
(2,3). 

B 


4. Find the constrained maxima and minima of f(x,y,z) = x + y2 + 2z given that 4x + 9y — 
36z = 36. 


5. Find the volume of the largest rectangular parallelepiped that can be inscribed in the 
ellipsoid 


+4 =1. 


x2 2ر‎ 22 
a b2 2 


Cc 


13See BAZARAA, SHERALI and SHETTY. 


3 Multiple Integrals 


3.1 Double Integrals 


In single-variable calculus, differentiation and integration are thought of as inverse opera- 
tions. For instance, to integrate a function f(x) it is necessary to find the antiderivative of f, 
that is, another function F(x) whose derivative is f(x). Is there a similar way of defining in- 
tegration of real-valued functions of two or more variables? The answer is yes, as we will see 
shortly. Recall also that the definite integral of a nonnegative function f(x) > 0 represented 
the area “under” the curve y = f(x). As we will now see, the double integral of a nonnegative 
real-valued function f(x,y) > 0 represents the volume “under” the surface z = f(x,y). 

Let f(x,y) be a continuous function such that f(x,y) > 0 for all (x,y) on the rectangle 
R= {(x,y):a <x < b,c < y < d} in R2. We will often write this as R = [a,b] x [c,d]. For any 
number x<* in the interval [a, b], slice the surface z = f(x, y) with the plane x = x* parallel to 
the yz-plane. Then the trace of the surface in that plane is the curve f(x*,y), where x* is 
fixed and only y varies. The area A under that curve (i.e. the area of the region between the 
curve and the xy-plane) as y varies over the interval [c,d] then depends only on the value of 
xx. So using the variable x instead of x*, let A(x) be that area (see Figure 3.1.1). 


2= f(y) 


Figure 3.1.1 The area A(x) varies with x 


Then A(x) = f 0 f(x,y)dy since we are treating x as fixed, and only y varies. This makes 
sense since for a fixed x the function f(x, y) is a continuous function of y over the interval 
[c,d], so we know that the area under the curve is the definite integral. The area A(x) is a 
function of x, so by the “slice” or cross-section method from single-variable calculus we know 
that the volume V of the solid under the surface z = f(x,y) but above the xy-plane over the 
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rectangle R is the integral over [a,b] of that cross-sectional area A(x): 


b b d 
4 =| A(x)dx ا‎ 1 f(x,y)dy 


We will always refer to this volume as “the volume under the surface”. The above expression 
uses what are called iterated integrals. First the function f(x, y) is integrated as a func- 
tion of y, treating the variable x as a constant (this is called integrating with respect to y). 
That is what occurs in the “inner” integral between the square brackets in equation (3.1). 
This is the first iterated integral. Once that integration is performed, the result is then an 
expression involving only x, which can then be integrated with respect to x. That is what 
occurs in the “outer” integral above (the second iterated integral). The final result is then 
a number (the volume). This process of going through two iterations of integrals is called 
double integration, and the last expression in equation (3.1) is called a double integral. 


dx (3.1) 


Notice that integrating f(x,y) with respect to y is the inverse operation of taking the 
partial derivative of f(x,y) with respect to y. Also, we could just as easily have taken the 
area of cross-sections under the surface which were parallel to the xz-plane, which would 
then depend only on the variable y, so that the volume V would be 


d 
2 


It turns out that in general” the order of the iterated integrals does not matter. Also, we will 
usually discard the brackets and simply write 


b 
1 f(u,y)dx| dy . (3.2) 


d rb 
2 1 1 FE EY: (3.8) 


where it is understood that the fact that dx is written before dy means that the function 
f(x,y) is first integrated with respect to x using the “inner” limits of integration a and b, 
and then the resulting function is integrated with respect to y using the “outer” limits of 
integration c and d. This order of integration can be changed if it is more convenient. 


Example 3.1. Find the volume V under the plane z = 8x + 6y over the rectangle R = [0,1] x 
[0,2]. 


1due to Fubini’s Theorem. See Ch. 18 in TAYLOR and MANN. 
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Solution: We see that f(x,y)= 8x +6y>Ofor Osx sland Os y =2, so: 


2 r1 
= (8x + 6y)dxdy 
0 0ل‎ 


2 
. م(‎ + 6xy 
0 


2 
(4+ 6y)dy 
0 


=1 dû 
ا‎ J 


2 
| 3+ ر4 
20 = 


Suppose we had switched the order of integration. We can verify that we still get the same 
answer: 


1 2م‎ 
¥= (8x + 6y)dydx 
0 J0 


1 =9 
= 1 (8x + 3| | a 
0 =0 


1 
= 1 (16x + 12)dx 
0 


1 
842 + 124 | 

0 
- 20 


Example 3.2. Find the volume V under the surface z = e**Y over the rectangle R = [2,3] x 
[1,2]. 


Solution: We know that f(x,y) = e> > 0 for all (x, y), so 


2 
2م 3م کے 


| 
® 
O1 
1 
M 
د‎ 
1 
e 
mM 
د‎ 
1 
mM 
دن‎ 
س‎ 
1 
mM 
eı 
1 
N 
MQ 
د‎ 
+ 
MQ 
دن‎ 


Recall that for a general function f(x), the integral f(x)dx represents the difference of 
the area below the curve y = f(x) but above the x-axis when f(x) > 0, and the area above the 
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curve but below the x-axis when f(x) = 0. Similarly, the double integral of any continuous 
function f(x, y) represents the difference of the volume below the surface z = f(x, y) but above 
the xy-plane when f(x, y) > 0, and the volume above the surface but below the xy-plane when 
f(x,y) = 0. Thus, our method of double integration by means of iterated integrals can be 
used to evaluate the double integral of any continuous function over a rectangle, regardless 
of whether f(x,y) > 0 or not. 


2r fr 
Example 3.3. Evaluate 1 sin(x + y)Jdxdy. 
0 0 


Solution: Note that f(x, y) = sin(x + y) is both positive and negative over the rectangle [0,7] x 
[0, 2r]. We can still evaluate the double integral: 


2r IT 2r aT 
1 sin(x + y)Jdxdy = 1 (cos + | ) dy 
0 0ل‎ 0 =0 


2r 
(-cos(y +7) +cosy)dy 


2 
-sin(y + 7)+ sin |j = —sin37 + sin27 —- (-sin7 + sin0) 
= 0 
Exercises 

A 
For Exercises 1-4, find the volume under the surface z = f(x,y) over the rectangle R. 
1. f(x,y)=4xy, R = [0,1] x [0,1] 2. f(x,y)=e* 7, R = [0,11 x [-1,1| 
3. f(x,y) = x3 + y7, R = [0,1] x [0,1] 4. f(x,y) =x*+xy+ y3, R = [1,2] x [0,2] 


For Exercises 5-12, evaluate the given double integral. 


1 2م 1 2م‎ 
5. 1 ا‎ (1-y) dxdy 6. 1 x(x + y)dxdy 
0 J1 0 J0 
2 1م‎ 2 ^1 
1. 1 1 (x+2)dxdy 8. | 1 x(xy + sinx)dxdy 
0 J0 1-ل1-‎ 
r/2 r1 r frr/2 
9. | | eon axay 1. | | sinxcos(y- r)dxdy 
0 0 0 J0 


2 2م 1 4م‎ 
11. 1 xydxdy 12. 1 1 1dxdy 
0 J1 1 


13. Let M be a constant. Show that J J Mdxdy = M(d -c)(b -a). 
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3.2 Double Integrals Over a General Region 


In the previous section we got an idea of what a double integral over a rectangle represents. 
We can now define the double integral of a real-valued function f(x,y) over more general 
regions in RP. 

Suppose that we have a region R in the xy-plane that is bounded on the left by the vertical 
line x = a, bounded on the right by the vertical line x = b (where a < b), bounded below by 
a curve y = g(x), and bounded above by a curve y = g(x), as in Figure 3.2.1(a). We will 
assume that g,(x) and g(x) do not intersect on the open interval (a, b) (they could intersect 
at the endpoints x = a and x = b, though). 


4Y 4y 
y= £2)%( 
d CN STEIN 

٣ ا‎ 
١ X= hد(y)‎ 
| y= 81)%( E EE 
1 1 R 
ا‎ £ 

0 ۳ b 0 
(a) Vertical slice: J f(x,y dydx (b) Horizontal slice: 2 4 ١ f(x,y dxdy 


Figure 3.2.1 Double integral over a nonrectangular region R 


Then using the slice method from the previous section, the double integral of a real-valued 


function f(x, y) over the region R, denoted by 1 / f(x,y)JdA, is given by 


R 
J fe.naa = 


£2)%( 
1 f(x,y)dy| dx (3.4) 
R 


81)%( 


This means that we take vertical slices in the region R between the curves y = g(x) and 
y = g(x). The symbol dA is sometimes called an area element or infinitesimal, with the A 
signifying area. Note that f(x, y) is first integrated with respect to y, with functions of x as 
the limits of integration. This makes sense since the result of the first iterated integral will 
have to be a function of x alone, which then allows us to take the second iterated integral 
with respect to x. 

Similarly, if we have a region R in the x<y-plane that is bounded on the left by a curve 
x = h,(y), bounded on the right by a curve x = hڪ(y),‎ bounded below by the horizontal line 
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y = c, and bounded above by the horizontal line y = d (where c < d), as in Figure 3.2.1(b) 
(assuming that h,(y) and h»(y) do not intersect on the open interval (c,@d)), then taking 


horizontal slices gives 
d 
Jj e.naa = | 
cC 


R 


hz(y) 
٣ f(x,y)dx Î dy (3.5) 


1)7( 


Notice that these definitions include the case when the region R is a rectangle. Also, if 
f(x,y) > 0 for all (x,y) in the region R, then Jf f(x,y)dA is the volume under the surface 
R 


z = f(x,y) over the region R. 
Example 3.4. Find the volume V under the plane z = 8x + 6y over the region R = {(x,y):0 = 
x<1, Osy < 2x2}. 


Solution: The region R is shown in Figure 3.2.2. Using vertical slices we 
get: 


V = |j se+enaa 
R 


1 [ 2م‎ 
چ‎ / (8x + 6y)dy 
0 |لo‎ 


1 22 
= 1 (8x +3] | a 
0 y=0 


1 
= (16x3 + 12x0) dx 
0 


0 1 


dx 


Figure 3.2.2 


1 
64 - 2 4+2 = | کو + رھ 
0 


We get the same answer using horizontal slices (see Figure 3.2.3): 
V = |j «se+enaa 
R 
1 
1 (8x + 6y)dx 
/2 


2 
E 


2 ST 
2 
| (4 +6 ر ا‎ ( dy 
2 2 
| «4+6y-@y+ yay = (4+ 4y-3۷2y)dy 


6V3 5/2 |2 _ 62733 _ 48 _ 32 _ 
0 | = 8+8 e = 16-£ = - 64 


dy 


Figure 3.2.3 


4y + 2y 
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Example 3.5. Find the volume V of the solid bounded by the three coordinate planes and 
the plane 2x + y+ 4z =4. 


(0,0,1) 2x+y+4z2 =4 


e 


(0,4,0) 


x (2,0,0) 


(a) (b) 


Figure 3.2.4 


Solution: The solid is shown in Figure 3.2.4(a) with a typical vertical slice. The volume V 
is given by Jf f(x,y)dA, where f(x,y) = z = 14 - 2x — y) and the region R, shown in Figure 
R 


3.2.4(b),is R ={(x,yJ):O0sx <2, Os y < -2x +4}. Using vertical slices in R gives 
V = J 1(4-2x-y)dA 
R 
27 م-2x+4‎ 
[f fa-2-nay 
0 0 
2 


y=-2x+4 
=| (4-2-7 | a 


2 
ج‎ 1 14 - 2%) dx 
0 


dx 


2 
1 3 _— 64 _ 4 
-3g)4-2x( 1 E 


For a general region R, which may not be one of the types of regions we have considered so 
far, the double integral Jf f(x, y)dA is defined as follows. Assume that f(x, y) is a nonnega- 
R 


tive real-valued function and that R is a bounded region in R, so it can be enclosed in some 
rectangle [a, b|x [c,d]. Then divide that rectangle into a grid of subrectangles. Only consider 
the subrectangles that are enclosed completely within the region R, as shown by the shaded 
subrectangles in Figure 3.2.5(a). In any such subrectangle [x;,;,ı] x [Y;,Y;,,J, pick a point 
(X:«, Yj»). Then the volume under the surface z = f(x,y) over that subrectangle is approxi- 
mately f(%;., Y;.) Ax; Ay;, where Ax; = X%;,ı ¬ %;, AY; = Yj, ¬ Jj, and f(%;., yj) is the height and 
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Ax; Ay; is the base area of a parallelepiped, as shown in Figure 3.2.5(b). Then the total vol- 
ume under the surface is approximately the sum of the volumes of all such parallelepipeds, 
namely 


22 (Siss Yj) Ax; AYj, (3.6) 
Jj i 


where the summation occurs over the indices of the subrectangles inside R. If we take 

smaller and smaller subrectangles, so that the length of the largest diagonal of the subrect- 

angles goes to 0, then the subrectangles begin to fill more and more of the region R, and so 

the above sum approaches the actual volume under the surface z = f(x, y) over the region R. 

We then define Jf f(x,y)dA as the limit of that double summation (the limit is taken over all 
R 


subdivisions of the rectangle [a,b] x [c,d] as the largest diagonal of the subrectangles goes 


to 0). 
Ay 
d 
12 
2= f(,۷) 
7 
Ax: ۰ E. 9 
Yt ۳ً 1 
ا‎ 
1 
(i 9 ¥)* ) 
ا‎ Yi J" JY 
0 ا‎ 
1 ا ر‎ 5 ٤ 
c N AO E إا‎ 
i 2 
0 a Xj; Xir1 b LL ix jx R 
(a) Subþbrectangles inside the region R (b) ْParallelepiped over a subrectangle, 


with volume f(%;x, Yj») Ax; AY; 


Figure 3.2.5 Double integral over a general region R 


A similar definition can be made for a function f(x, y) that is not necessarily always non- 
negative: just replace each mention of volume by the negative volume in the description 
above when f(x, y) < 0. In the case of a region of the type shown in Figure 3.2.1, using the def- 
inition of the Riemann integral from single-variable calculus, our definition of Jf f(t,y)dA 

R 


reduces to a sequence of two iterated integrals. 

Finally, the region R does not have to be bounded. We can evaluate improper double 
integrals (i.e. over an unbounded region, or over a region which contains points where the 
function f(x, y) is not defined) as a sequence of iterated improper single-variable integrals. 
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o r1/x2 
Example 3.6. Evaluate ا‎ 2ydydx. 
1 Jo 


Solution: 
oo r1/x oo و‎ y= 
1 1 2ydyds = | ا‎ | a 
1 0ل‎ 1 =0 
Ç9, oo 
= x *d× = ۾‎ = 0-)-3( > 1 
Exercises 
A 
For Exercises 1-6, evaluate the given double integral. 
1 ^1 IT PY 
| | 24x ydy dx 2. | sinxdxdy 
0 Jz 0 Jo 
2 rlnx 2 2Y 
8| | Axdydx 4| | e” dxdy 
1 0ل‎ 0 J0 
r/2 ry oo foo aS 
5. | cosx sinydxdy 6. | xye ® dxdy 
0 J0 


2 ry 1 م‎ 
1. 1 1 1dxdy 8. 1 1 2dydx 
0 J0 0 J0 


9. Find the volume V of the solid bounded by the three coordinate planes and the plane 
x+y+z=1. 


10. Find the volume V of the solid bounded by the three coordinate planes and the plane 
3x + 2y + 52 =6. 


11. Explain why the double integral /f 1dA gives the area of the region R. For simplicity, 


R 
you can assume that R is a region of the type shown in Figure 3.2.1(a). 


C 


12. Prove that the volume of a tetrahedron with mutually perpendic- 
ular adjacent sides of lengths a, b, and c, as in Figure 3.2.6, is e: 
(Hint: Mimic Example 3.5, and recall from 


Section 1.5 how three noncollinear points determine a plane.) 


13. Show how Exercise 12 can be used to solve Exercise 10. Figure 3.2.6 
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3.3 Triple Integrals 


Our definition of a double integral of a real-valued function f(x, y) over a region R in R can 
be extended to define a triple integral of a real-valued function f(r, y,z) over a solid S in Rî. 
We simply proceed as before: the solid S can be enclosed in some rectangular parallelepiped, 
which is then divided into subparallelepipeds. In each subparallelepiped inside S, with sides 
of lengths Ax, Ay and Az, pick a point (<,„,y„.,2,„). Then define the triple integral of f(x, y,z) 
over S, denoted by fff f(xt,y,z2)dV, by 

S 


i f(x,y,z2)dV = lim) J3 f(2 DAXAYAzZ , (3.7) 
S 


where the limit is over all divisions of the rectangular parallelepiped enclosing S into sub- 
parallelepipeds whose largest diagonal is going to 0, and the triple summation is over all the 
subparallelepipeds inside S. It can be shown that this limit does not depend on the choice 
of the rectangular parallelepiped enclosing S. The symbol dV is often called the volume 
element. 

Physically, what does the triple integral represent? We saw that a double integral could 
be thought of as the volume under a two-dimensional surface. It turns out that the triple 
integral simply generalizes this idea: it can be thought of as representing the hypervolume 
under a three-dimensional hypersurface w = f(x,y,z) whose graph lies in R^. In general, 
the word “volume” is often used as a general term to signify the same concept for any n- 
dimensional object (e.g. length in R', area in R). It may be hard to get a grasp on the concept 
of the “volume” of a four-dimensional object, but at least we now know how to calculate that 
volume! 

In the case where S is a rectangular parallelepiped [xı,x»1 x [yı,y»| x [zı,z»1, that is, S = 
{(C,Y 2:1 SX SX, Y1 5 Y = Ys, 21 = 2 =“ 29}, the triple integral is a sequence of three iterated 


integrals, namely 
22 P2 x2 
Jl s,2av ا 1 1 ج‎ f(xu,y,z2dxdydz , (3.8) 
5 21 JY1 1 


where the order of integration does not matter. This is the simplest case. 

A more complicated case is where S is a solid which is bounded below by a surface z = 
£1(x,y), bounded above by a surface z = g(x,y), y is bounded between two curves h,(x) and 
h(x), and x varies between a and b. Then 


b rhz(x) ^8g82(x,y) 

i f(x,Y,2)dV ا‎ f(xr,y,2dzdydx . (3.9) 
a dh 

S 


1)) عل‎ 1),y( 


Notice in this case that the first iterated integral will result in a function of x and y (since its 
limits ofintegration are functions of x and y), which then leaves you with a double integral of 
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a type that we learned how to evaluate in Section 3.2. There are, of course, many variations 
on this case (for example, changing the roles of the variables x, y, z), so as you can probably 
tell, triple integrals can be quite tricky. At this point, just learning how to evaluate a triple 
integral, regardless of what it represents, is the most important thing. We will see some 
other ways in which triple integrals are used later in the text. 


3 r2 r1 
Example 3.17. Evaluate | 1 1 (xy+z)dxdydz. 
0 0ل 0ل‎ 


3 r2 r1 3 2م‎ = 
[| | cor aaxasae = | f (e+ | aa: 
0 lo 0ل‎ 0 J0 =0 
3 2م‎ 
| 1 (y+ 2) dydz 
0 J0 
3 2 
1 „2 2 
. 2y + d 
f ا‎ 


3 
1 (1+ 22)dz 
0 


Solution: 


3 
z+ 2| = 12 
0 
1 »-1م‎ م2-»-رy‎ 
Example 3.8. Evaluate | 1 (x+y+z2)dzdydx. 
0 Jo 0 


Solution: 


1 »-1م 1 yر-»-2م -1م‎ 2=2-»-y 
1 1 (x+y+z)dzdydx = 1 (+ 2+ 12| | ayas 
0 J0 0 0 J0 2=0 


1-% 


(e+ y)(2-x-y)+ (2-x- 9)) dy ds 


(2-1 - xy - (ر3‎ dydx 


[€ 12 1.21 E 
= 9 ۴ XxX xX i 
ا‎ J-o JY<XY< OXY < GY ر‎ 
1 
سے‎ 11 1.8 
| (4 - 2x + k4) dx 
1 
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Note that the volume V of a solid in R is given by 


Jj av. (3.10(‏ ك 
S‏ 


Since the function being integrated is the constant 1, then the above triple integral reduces 
to a double integral of the types that we considered in the previous section if the solid is 
bounded above by some surface z = f(x,y) and bounded below by the xy-plane z = 0. There 
are many other possibilities. For example, the solid could be bounded below and above by 
surfaces Zz = g(x,y) and z = g(x, y), respectively, with y bounded between two curves h(x) 
and hz(x), and x varies between a and b. Then 


b rh2(x) ^g2(%,9) b rhz(x«) 
v= jj av = | ا‎ 1dady dx = | ا‎ (g£2(%,y)- £1, y)) dy dx 
a Jhı(x«) Jgı1(x,y) a Jhı(«) 


just like in equation (3.9). See Exercise 10 for an example. 


Exercises 


A 


For Exercises 1-8, evaluate the given triple integral. 


3 ^2 r1 1 fx ry 
1. 1 1 xyz dx dy dz 2. 1 1 xyzdzdy dx 
0 0ل 0ل 0 0ل 0ل‎ 
I fx Pxy 5 1 مz‎ py 2 
3. 1 x“ sinzdzdydx 4. 1 1 1 ze” dxdydz 
0 JO J0 0 JO JO 
e fy rlly جم رم 2 و‎ 
5. ا‎ 1 1 x “zdxdzdy 6. 1 1 1 yzdxdzdy 
1 0ل 0ل‎ 1 Jo 0ل‎ 
2 ^4 "3 1 م1-x‎ م1-x-y‎ 
7 ا‎ 1 1dxdydz 8. | 1 1dzdydx 
1 J2 J0 0 J0 0 


9. Let M be a constant. Show that i ا‎ J Mdxdy dz = M(z» — z21)(y2 — Y1)(%»2 = x1). 
B 


10. Find the volume V of the solid S bounded by the three coordinate planes, bounded above 
by the plane x + y +z =2, and bounded below by the plane z2 = x+y. 


C 


2z 


b y b 
11. Show that f(x(dxdydz ت‎ O Fads. (Hint: Think of how changing the 


a da da a 
order of integration in the triple integral changes the limits of integration.) 
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3.4 Numerical Approximation of Multiple Integrals 


As you have seen, calculating multiple integrals is tricky even for simple functions and 
regions. For complicated functions, it may not be possible to evaluate one of the iterated in- 
tegrals in a simple closed form. Luckily there are numerical methods for approximating the 
value of a multiple integral. The method we will discuss is called the Monte Carlo method. 
The idea behind it is based on the concept of the average value of a function, which you 
learned in single-variable calculus. Recall that for a continuous function f(x), the average 
value f of f over an interval [a,b] is defined as 


1 b 
f= | f(0dx . (3.11) 
b-—-a a 


The quantity b - a is the length of the interval [a,b], which can be thought of as the 
“volume” of the interval. Applying the same reasoning to functions of two or three variables, 
we define the average value of f(x, y) over a region R to be 


1 
= 1 GAA 8.19) 


where A(R) is the area of the region R, and we define the average value of f(x,y,z) over a 
solid S to be 


1 
= ,2)dV, 3.13 
f > | f(%,y,2) (3.13( 
S 
where V(S) is the volume of the solid S. Thus, for example, we have 


J fed4 = A(R) . 3.14 
R 


The average value of f(x,y) over R can be thought of as representing the sum of all the 
values of f divided by the number of points in R. Unfortunately there are an infinite number 
(in fact, uncountably many) points in any region, i.e. they can not be listed in a discrete 
sequence. But what if we took a very large number N of random points in the region R 
(which can be generated by a computer) and then took the average of the values of f for 
those points, and used that average as the value of F? This is exactly what the Monte Carlo 
method does. So in formula (3.14) the approximation we get is 


£2 _ (F92 
1 f(x, YJdA A(R)F + A(R)\/ 1 (3.15) 


R 


where 


N N 2 
3 D-1 iY: 0 f2 2;1) (%::¥:)) 


3.16 
N N , (3.16) 
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with the sums taken over the N random points (%1, Y1), ..., (Xn, Yn). The + “error term” in 
formula (3.15) does not really provide hard bounds on the approximation. It represents a 
single standard deviation from the expected value of the integral. That is, it provides a likely 
bound on the error. Due to its use of random points, the Monte Carlo method is an example 
of a probabilistic method (as opposed to deterministic methods such as Newton’s method, 
which use a specific formula for generating points). 

For example, we can use formula (3.15) to approximate the volume V under the plane 
z = 8x + 6y over the rectangle R = [0,1] x [0,2]. In Example 3.1 in Section 3.1, we showed 
that the actual volume is 20. Below is a code listing (montecarlo.java) for a Java program 
that calculates the volume, using a number of points N that is passed on the command line 
as a parameter. 


//Program to approximate the double integral of f(x,¥)=8x+6y 
//over the rectangle [0,1]x[0,2]. 
public class montecarlo 4 
public static void main(String[] args) { 
//Get the number N of random points as a command-line parameter 
int N = Integer.parseInt(args[0]); 
double x = 0; //x-coordinate of a random point 
double y = 0; //y-coordinate of a random point 
double f = 0.0; //Value of f at a random point 
double mf = 0.0; //Mean of the values of f 
double mf2 = 0.0; //Mean of the values of f^2 
for (int i=0;i<N;i++) { //Get the random coordinates 
x = Math.random(); //x is between 0 and 1 
y 2 x Math.random(); //y is between 0 and 2 
1 8*x + 6*y; //Value of the function 
mf = mf + f; //Add to the sum of the f values 
mf2 = mf2 + fxf; //Add to the sum of the f^2 values 


} 
mf = mf/N; //Compute the mean of the f values 
mf2 = mf2/N; //Compute the mean of the f^2 values 
System.out.println("N = " + N + ": integral = " + vol()*mf + " +/- " 
+ vol()*Math.sqrt((mf2 - Math.pow(mf,2))/N)); //Print the result 

} 

//The volume of the rectangle [0,1]x[0,2] 

public static double vol() { 
return 1*2; 


Listing 3.1 Program listing for montecarlo.java 


The results of running this program with various numbers of random points (e.g. java 
montecarlo 100) are shown below: 
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= 10: 19.36543087722646 +/- . 2. 7 

= 100: 21. 334419561385353 +/-| ¥ 9 

= 1000: 19.807662237526227 +/- 0.267017065 

= 10000: 20.080975812043256 +/- 0.083788162 06 

= 100000: 20.009403854556716 +/- N 7 
0 


= 1000000: 20.000866994982314 +/- 0.00832116874864286 


۹ ا ا ا ا 


As you can see, the approximation is fairly good. As N ¬ co, it can be shown that the 
Monte Carlo approximation converges to the actual volume (on the order of O(VN), in com- 
putational complexity terminology). 

In the above example the region RF was a rectangle. To use the Monte Carlo method for 
a nonrectangular (bounded) region R, only a slight modification is needed. Pick a rectangle 
FÊ that encloses R, and generate random points in that rectangle as before. Then use those 
points in the calculation of f only if they are inside R. There is no need to calculate the area 
of R for formula (3.15) in this case, since the exclusion of points not inside R allows you to 
use the area of the rectangle Ê instead, similar to before. 

For instance, in Example 3.4 we showed that the volume under the surface z = 8x + 6y 
over the nonrectangular region R = {(x,y):0 <= x = 1, 0< y < 2x%} is 6.4. Since the rectangle 
F = [0,1] x [0,2] contains R, we can use the same program as before, with the only change 
being a check to see if y < 2x for a random point (x,y) in [0,1] x [0,2]. Listing 3.2 below 
contains the code (montecarlo2.java): 


//Program to approximate the double integral of f(x,y)=8x+6y over the 
//region bounded by x=0, x=1, y=0, and y=2x^2 
public class montecarlo2 { 
public static void main(String[] args) {4 
//Get the number N of random points as a command-line parameter 
int N = Integer.parseInt(args[0]); 
double x = 0; //x-coordinate of a random point 
double y = 0; //y-coordinate of a random point 
double f = 0.0; //Value of f at a random point 
double mf = 0.0; //Mean of the values of f 
double mf2 = 0.0; //Mean of the values of f^2 
for (int i=0;i<N;i++) { //Get the random coordinates 
x = Math.random(); //x is between 0 and 1 
y = 2 * Math.random(); //y is between O and 2 
if (y < 2*Math.pow(x,2)) { //The point is in the region 
f = 8*%x + 6*y; //Value of the function 
mf = mf + f; //Add to the sum of the f values 
mia = mîi2 # f*1; //Add to the SUM Of the f2 values 
} 
} 
mf = mf/N; //Compute the mean of the f values 
mf2 = mf2/N; //Compute the mean of the f^2 values 
System.out.println("N = " + N + ": integral = " + vol()*mf + 
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" +/- " + vo0ol()x*Math.sqrt((mf2 - Math.pow(mf,2))/N)); 
} 
//The volume of the rectangle [0,1]x[0,2] 
public static double vol() { 
return 1*2; 
} 
} 


Listing 3.2 Program listing for montecarlo2.java 


The results of running the program with various numbers of random points (e.g. java 
montecarlo2 1000) are shown below: 


= 10: integral = 6.95747529014894 +/- 2.988302 

= 100: integral = 6.3149056229650355 +/- 0.9549009662109 
= 1000: integral = 6.477032813858756 +/- 0.31916837260973624 
= 10000: integral = 6.349975080015089 +/- 0.10040086346895105 
= 100000: integral = 6.440184132811864 +/- 0.03200476870881392 
= 1000000: integral = 6.417050897922222 +/- 0.01009454409789472 


0 


To use the Monte Carlo method to evaluate triple integrals, you will need to generate 
random triples (x,y,z) in a parallelepiped, instead of random pairs (x, y) in a rectangle, and 
use the volume of the parallelepiped instead of the area of a rectangle in formula (3.15) (see 
Exercise 2). For a more detailed discussion of numerical integration methods, see PRESS et 
al. 


Exercises 


1. Write a program that uses the Monte Carlo method to approximate the double integral 
JÎ e“ dA, where R = [0,1]x [0,1]. Show the program output for N = 10, 100, 1000, 10000, 


R 
100000 and 1000000 random points. 


2. Write a program that uses the Monte Carlo method to approximate the triple integral 
Jf e*Y* dV, where S = [0,1] x [0,1] x [0,1]. Show the program output for N = 10, 100, 


S 
1000, 10000, 100000 and 1000000 random points. 
3. Repeat Exercise 1 with the region R = {(x,y):-1sx s1, Osys«%. 
4. Repeat Exercise 2 with the solid S = {(x,y,2):O0sxs1, Osys1l, Oszs1l-x-—y}. 


5. Use the Monte Carlo method to approximate the volume of a sphere of radius 1. 


N 


2 
6. Use the Monte Carlo method to approximate the volume of the ellipsoid + 2 +4 =1. 
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3.5 Change of Variables in Multiple Integrals 


Given the difficulty of evaluating multiple integrals, the reader may be wondering if it is 
possible to simplify those integrals using a suitable substitution for the variables. The an- 
swer is yes, though it is a bit more complicated than the substitution method which you 
learned in single-variable calculus. 

Recall that if you are given, for example, the definite integral 


2 
ا‎ x Vx2-1dx, 
1 


then you would make the substitution 


م ج 1- ې =ںu‏ 


du = 2xdx 


=u+1 


which changes the limits of integration 


x=1 > u=0 


x=2 > u =3 


so that we get 
2 2 
| dve-1as = | 14.2۷42 -1dx 
1 1 
3 
| e+ Dvaau 
0 


3 
1 ا لا 2 ن(‎ du , which can be easily integrated to give 
0 


148 _ 
ا 


Let us take a different look at what happened when we did that substitution, which will give 
some motivation for how substitution works in multiple integrals. First, we let u = x -1. 
On the interval of integration [1,2], the function x > x - 1 is strictly increasing (and maps 
[1,2] onto [0,3]) and hence has an inverse function (defined on the interval [0,3]). That is, 
on [0,3] we can define x as a function of u, namely 


x = g(u) = Vu+1. 
Then substituting that expression for x into the function f(x) = x3 Vx - 1 gives 


f(e) = f(g(w) = (u+1P Vu, 
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and we see that 


d 
5 = ع')u(‎ < d>» 


g (u)du 


dx = u+1 du, 
so since 


g)0(=1 > 0= g )1( 
g)3(=2 > 3= ۽‎ 1)2( 


then performing the substitution as we did earlier gives 


2 2 
| roar = | x3 Vx2-1dx 
1 1 
8 
=| 1(u+ DVudu , which can be written as 
0 


3 
= (u+1™ Vu -3(u +1) du , which means 
0 


2 £ 1)2( 
1 f(e)dx = 1 f(g(u)g (u)du . 
1 £ 1)1) 


In general, if x = g(u) is a one-to-one, differentiable function from an interval [c,d] (which 
you can think of as being on the “u-axis”) onto an interval [a, b] (on the x-axis), which means 
that g'(u) ¥ 0 on the interval (c,d), so that a = g(c) and b = g(d), then c = g (a) and d = 
g (Db), and 


b gb 
1 f(x)dx = 1 
a E0) 


This is called the change of variable formula for integrals of single-variable functions, and it 
is what you were implicitly using when doing integration by substitution. This formula turns 
out to be a special case of a more general formula which can be used to evaluate multiple 
integrals. We will state the formulas for double and triple integrals involving real-valued 
functions of two and three variables, respectively. We will assume that all the functions 
involved are continuously differentiable and that the regions and solids involved all have 
“reasonable” boundaries. The proof of the following theorem is beyond the scope of the text. 


) 
f(g(u)g (u)du . (3.17( 


2See TAYLOR and MANN, § 15.32 and § 15.62 for all the details. 
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Theorem 3.1. Change of Variables Formula for Multiple Integrals 
Let x = x(u,u) and y = y(u,u) define a one-to-one mapping of a region R' in the uu-plane onto 
a region R in the xy-plane such that the determinant 


o ox 
_ |@u du 
J(u,u) = oy oy (3.18) 
Qu du 
is never 0 in R'. Then 
j fe.naac.s) چ‎ J feu suruNl Turd Au.) . (3.19) 
R R' 


We use the notation dA(x, y) and dA(u,uv) to denote the area element in the (x,y) and (u,u) 
coordinates, respectively. 

Similarly, if x = x(u,u,w), y = y(u,u,w) and z = z(u,u,w) define a one-to-one mapping of 
a solid S' in uvw-space onto a solid S in xyz-space such that the determinant 


Ox Ox Ox 
0u duv dw 
J(u,u,w) = (3.20) 
Qu du Ow 
0z dz dz 
0u duv dw 


is never 0 in S’, then 


Jlj fes.nDavea.s,2= f fetuo,w), surr), atu,0,DI(uv,w)| dV (uu, . (3.21) 
S S 


The determinant J(u,v) in formula (3.18) is called the Jacobian of x and y with respect 
to u and v, and is sometimes written as 


0), 7( 
J(u,u) = 1 3.22 
(u,u) (u0) (8.22) 
Similarly, the Jacobian J(u,u,w) of three variables is sometimes written as 
0 
a (3.23) 
O(u,u,w) 


Notice that formula (3.19) is saying that dA(x, y) = |J(u,u)| dA(u,u), which you can think of 
as a two-variable version of the relation dx = g'(u)du in the single-variable case. 
The following example shows how the change of variables formula is used. 
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Example 3.9. Evaluate J er dA, where R = {(x,y)J:x>0,yz>0,x+ys1I}. 
R 


Solution: First, note that evaluating this double integral without using substitution is prob- 
ably impossible, at least in a closed form. By looking at the numerator and denominator of 
the exponent of e, we will try the substitution u = x — y and u = x + y. To use the change of 
variables formula (3.19), we need to write both x and y in terms of u and vu. So solving for 
x and y gives x = 1u +u) and y = 1 — u). In Figure 3.5.1 below, we see how the mapping 
x = x(u,u) = (u +), y= y)u,ا0(‎ = 1(v — u) maps the region R' onto R in a one-to-one manner. 


04 4u 


Figure 3.5.1 The regions R and R'’ 


Now we see that 


dx Ox a 
du duv 3 3 1 1| 1 
J 9’ > کے ب کڪ چ‎ 5 SAS e, 
(u,u) o» oy | 1 | 3 |J(u,u)| 2 
Qu du 


so using horizontal slices in R’', we have 


[7 a کڪ‎ J eur surunlzcu,oa4a 
R R' 


0 
1 u |U=uU 
ت‎ U 
= (e E 
1 
چ‎ (e-e "(du 
1 
U 
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| 
ر‎ 
mM 
1] 
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م‎ 
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3.5 Change of Variables in Multiple Integrals 


The change of variables formula can be used to evaluate double integrals in polar coordi- 


x = x(r,O0) = rcos0 and y = y(r,0) = rsinO0, 


nates. Letting 


we have 
dx ûdx 
Ar AA cos0 —rsinO 
J(u,u) = or û0 > = rcos20+rsin0 = r > |J(u,u)| = |r| =r, 
dy dy sin rcosO0 
QOr 30 


(3.24) 


| fe.nasay کک‎ |j £<*cose,rsino)rar ao , 
R' 


so we have the following formula: 


Double Integral in Polar Coordinates 


R 


where the mapping x = rcosO, y = rsinO0 maps the region R' in the r0-plane onto the 


region R in the xy-plane in a one-to-one manner. 


Example 3.10. Find the volume V inside the paraboloid z = x + y for O< z =1. 


Figure 3.5.2 z= 4+ ر‎ 


Solution: Using vertical slices, we see that 


V = j| a-aaa = Jj a-«+ymaa, 
R R 


where R = {(x,y) : x2 + y7 < 1} is the unit disk in R (see 
Figure 3.5.2). In polar coordinates (r,0) we know that 
x+y =r and that the unit disk R is the set R' = {(r,0): 
Osrs1,0 <0 <2r}. Thus, 


2r r1 
V | | a-rmrarae 
0 0 


2r r1 
2 1 (r-r3)drd0 
0 0 


1 
NI3 س‎ 
¥ 
اھ‎ 
QA 
® 
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Example 3.11. Find the volume V inside the cone z = x2 + y% for O< 2z <1. 


Solution: Using vertical slices, we see that 
= j| a-aa4 ک‎ j (2+) dA, 
R R 


where R = {(x, y):x2 + y% < 1} is the unit disk in R 
(see Figure 3.5.3). In polar coordinates (r,0) we know 
that Vx + y2 =r and that the unit disk R is the set 
R' = {(r,0):0 <r <1,0 <0 < 2r}. Thus, 


xX 


Figure 3.5.3 z= x2 + y2 


1 
= 
N 
3 
o 
Qu 
© 


In a similar fashion, it can be shown (see Exercises 5-6) that triple integrals in cylindrical 
and spherical coordinates take the following forms: 


Triple Integral in Cylindrical Coordinates 


i f(x,y,2dxdy dz f(rcos0,rsinO0,z2)rdrdOdz , (3.25) 
S 9S 


where the mapping x = rcosO, y = rsinO, z = z maps the solid S' in r0z-space onto the 
solid S in xyz-space in a one-to-one manner. 


Triple Integral in Spherical Coordinates 


f (o sin cos0, p sin Q sin0, p cos p) p> sinpdpdpd0 , (3.26)‏ ا = f(x,y,2dxdydz‏ ا 
S S'‏ 


where the mapping x = p sin Q cosO, y = psinQ sinO, z = pcos maps the solid S' in pp0- 
space onto the solid S in xyz-space in a one-to-one manner. 
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Example 3.12. For a > 0, find the volume V inside the sphere S = x + y2 +z = aڑ.‎ 


Solution: We see that S is the set p = a in spherical coordinates, so 


V = ل‎ = [f [1o sneapaças 
ا‎ sinpdpd@g = 2 sinpdpd0 
ا‎ ۳ 


a 27 23 4ra3 
= کا‎ RE (0 = 30 = 3 


Exercises 


„. Find the volume V inside the paraboloid z = x + y for 0 < z <4. 


.„ Find the volume V inside the cone z = x2 + y% for 0< z =3. 


„. Find the volume V of the solid inside both x + y + 2% = 4 and x + y2 =1. 
„. Find the volume V inside both the sphere x + y + 2% = 1 and the cone z = Vx + y%. 


حم ۳ط N‏ ل من د o‏ 


. Prove formula (3.25). 6. Prove formula (3.26). 
7. Evaluate J) (ح) مذو‎ cos ) و‎ ( dA, where R is the triangle with vertices (0,0), (2,0) and 
R 
(1,1). (Hint: Use the change of variables u = (x + y)/2, u = (x — y7)/2.) 
8. Find the volume of the solid bounded by z = x + and z% = 4(2 + y). 


9. Find the volume inside the elliptic cylinder +7 =1forOsz s2. 
C 


10. Show that the volume inside the ellipsoid و ا‎ + =1is 4rabe . (Hint: Use the change 
of variables x = au, Yy = bu, z = cw, then O EE Bia rable S.12, ر(‎ 


11. Show that the Beta function, defined by 
1 
B(x,y) = 1 11-0 ldt, for x>0,y>0, 
0 
satisfies the relation B(y,x) = B(x, y) for x > 0, yJ >0. 


12. Using the substitution t = u/(u + 1), show that the Beta function can be written as 


oo u 1 
Be.) = | EET for x >0, y >0. 
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3.6 Application: Center of Mass 


Recall from single-variable calculus that for a region 7 

R={(,y):a <x < b,0 < y < f(x)} in R that represents y= f)»( 

a thin, flat plate (see Figure 3.6.1), where f(x) is a con- 7 
tinuous function on [a,b], the center of mass of R has 
coordinates (x, yJ) given by 


0 a b 
حدر لمو خدج‎ 
M M Figure 3.6.1 Center of mass of R 


where 


b 2 b 
M, = | a M, = | xf (Ddx, M= 


b 
1 fODdx , (3.27) 


بسچ 


assuming that R has uniform density, i.e the mass of R is uniformly distributed over the 
region. In this case the area M of the region is considered the mass of R (the density is 
constant, and taken as 1 for simplicity). 

In the general case where the density of a region (or lamina) R is a continuous function 
5 = 6(x,y) of the coordinates (x, y) of points inside R (where R can be any region in R2) the 
coordinates (x, J) of the center of mass of R are given by 


d 7= (3.28)‏ و 
ST and J= ٤‏ 


where 


M, = || sserna4, M, = || se, a4 M= [f se.naa, (3.29) 
R R R 


The quantities M, and M, are called the moments (or first moments) of the region R about 
the x-axis and y-axis, respectively. The quantity M is the mass of the region R. To see this, 
think of taking a small rectangle inside R with dimensions Ax and Ay close to 0. The mass 
of that rectangle is approximately 0(<x,yx)JAx Ay, for some point (xx, yx) in that rectangle. 
Then the mass of R is the limit of the sums of the masses of all such rectangles inside R as 
the diagonals of the rectangles approach 0, which is the double integral Jf 6(x,y)dA. 

R 


Note that the formulas in (3.27) represent a special case when 6(x, y) = 1 throughout R in 
the formulas in (3.29). 


Example 3.13. Find the center of mass of the region R = {(x,y):0 <x <1, 0 < y < 24%}, if the 
density function at (x,y) is 6(x,y)= x+y. 
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Solution: The region R is shown in Figure 3.6.2. We have 


M = j .»aa 
R 


1 2م‎ 
=| (x+y)Jdydx 0 
0 Jo 
2ر ا‎ Figure 3.6.2 
=| xy+ dx 
0 2 y=0 
ج‎ 1 (2x3 + 2x) dx 
0 
ک»‎ 25 9 
+ > 
2 5 0ا‎ 10 
and 
M, = | 3«.aa M, = | saa 
R R 
1 2م 1 22م‎ 
چ‎ y(x+ y)dydx a x(x + y)Jdydx 
0 J0 0 J0 
1 2 3 2=ر‎ 1 2 y= 
a ا ا | ا‎ | 
0 | 2 3 | =0 0 2 0 
1 84° 1 
ج‎ (2x + (dx ک‎ 1 (2x + 25)dx 
0 3 0 
گ»‎ 8 5 2 x» 1 
کک ا س لإ س = = لإ س‎ 
3 21l 7 5 15ا وم3‎ 
so the center of mass (xX, J) is given by 
My, 11/15 _ 22 _ Mr, 5/1 50 


“YY golO0 27 7M O 63 


Note how this center of mass is a little further towards the upper corner of the region R than 


45( 


when the density is uniform (it is easy to use the formulas in (3.27) to show that (#, ¥) = ( 


in that case). This makes sense since the density function 0(x, y) = x + y increases as (x, y) 


approaches that upper corner, where there is quite a bit of area. 


In the special case where the density function 6(x, y) is a constant function on the region 


R, the center of mass (x, y) is called the centroid of R. 
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The formulas for the center of mass of a region in R can be generalized to a solid S in RÎ. 
Let S be a solid with a continuous mass density function 6(x, y,z) at any point (x,y,z) in S. 
Then the center of mass of S has coordinates (%, J,7), where 

Myz Mxz May 


Ee FET ES , (3.30) 


where 


Myz = Jj e.s.2av , Mrz = Jjj 3e.s,2av , May = Jj] 8.s.2av , (3.31) 
S S S 
M = Jjj ses.mav . (3.32) 
S 


In this case, M,yz, M,z and M,, are called the moments (or first moments) of S around the 
yz-plane, xz-plane and xy-plane, respectively. Also, M is the mass of S. 


Example 3.14. Find the center of mass of the solid S = {(<,y,2):z >0, x2 + ر‎ +2 < a}, if 
the density function at (x,y,z) is 6(x,y,2) =1. 


Solution: The solid S is just the upper hemisphere inside the sphere 
of radius a centered at the origin (see Figure 3.6.3). So since the 
density function is a constant and S is symmetric about the z-axis, 
then it is clear that x = 0 and J = 0, so we need only find 2Z. We have 


M = Jj s«.s.2av = ||| 1av = Volumecs). 
S S 


But since the volume of S is half the volume of the sphere of radius 


a, which we know by Example 3.12 is e then M = And 


May = jjj 8..2av 
S 


5 ا‎ / / zdV , which in spherical coordinates is 
S 


xX 
Figure 3.6.3 


2r rr/2 ra 
1 | (p0 cos 4( م‎ sinpdpdpd0 
0 0 0 


= I sin p s0 f )مم‎ dpd0 


2r rr/2 4 
= 1 1 ® sinp cospdp d0 
0 0 
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2r prrl2 , 
May = > sin2pdpd0 (since sin 24 = 2sin p cos p) 
0 0 


2r = 7/2 
ت‎ 4 c0s20| ٤ | 
0 


=0 
2r 
a 
= > d0 
٤ 
_ ra® 
a 
SO 1 
2 May 5 1 3a 
M 2ra3 8 
Thus, the center of mass of S is (%,yJ,2)= (0,0, 2). 
Exercises 
A 
For Exercises 1-5, find the center of mass of the region R with the given density function 
0(x,y). 


1. R={(x,Y:O0sxs2, Osys4}, 6(x,yJ)= 2y 
2. R={(x,» :Osx<1, Oxy}, S(x,y )= x+y 
. R={(,y):y >0, x+y < a}, S(t, =1 
. R={(,y):y >0, x >0, ڈ1‎ 2+ y2 > 4}, 6), y( = V2 + y2 


3 
4 
5. R={(,y):y >0, x+y <1}, 0) ,yر(=y‎ 
B 


For Exercises 6-10, find the center of mass of the solid S with the given density function 
(t,2). 


6. S={(x,y,2):0sxs1l, Osys1l, Osz s1}, 0(x,y,2)= xyz 

۶ے + ر + ۶ = (2 ,ر ,)5 ,[ > ۶ے + ر + 2ى ,0 < 8S = ]),y,2(:2‏ .7 
a, 0(x,y,2)=1‏ > 2 + 2ر + 2× ,2>0 S={(x,y,2):x >0, yY>0,‏ .8 

9. S={(x,y,2:0sx<1, O<y<1, 0<2 <1}, S(x,y, 2) = 2 + y2 + 2 


10. S={(x,y,2):0sxs1l, Osys1l, Oszs1l—-x—y}, 0(x,y,2) =1 
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3.7 Application: Probability and Expected Value 


In this section we will briefly discuss some applications of multiple integrals in the field of 
probability theory. In particular we will see ways in which multiple integrals can be used to 
calculate probabilities and expected values. 


Probability 


Suppose that you have a standard six-sided (fair) die, and you let a variable X represent 
the value rolled. Then the probability of rolling a 3, written as P(X = 3), is 3, since there 
are six sides on the die and each one is equally likely to be rolled, and hence in particular 
the 3 has a one out of six chance of being rolled. Likewise the probability of rolling at most a 
3, written as P(X < 3), is > 3, since of the six numbers on the die, there are three equally 
likely numbers (1, 2, and 3) that are less than or equal to 3. Note that P(X <= 3) = P(X = 
1)+ P(X = 2)+ P(X = 3). We call X a discrete random variable on the sample space (or 
probability space) ©2 consisting of all possible outcomes. In our case, ©2 = {1,2,3,4,5,6}. An 
event A is a subset of the sample space. For example, in the case of the die, the event X <3 
is the set {1,2,3}. 


Now let X be a variable representing a random real number in the interval (0, 1). Note 
that the set of all real numbers between 0 and 1 is not a discrete (or countable) set of values, 
i.e. it can not be put into a one-to-one correspondence with the set of positive integers. In 
this case, for any real number x in (0, 1), it makes no sense to consider P(X = x) since it must 
be 0 (why?). Instead, we consider the probability P(X < x), which is given by P(X <x) =x. 
The reasoning is this: the interval (0, 1) has length 1, and for x in (0, 1) the interval (0,x) 
has length x. So since X represents a random number in (0, 1), and hence is uniformly 
distributed over (0, 1), then 


length of (0,x%) _ x 


AEE eB TI 


=X. 


We call X a continuous random variable on the sample space ©2 = (0,1). An event A is a 
subset of the sample space. For example, in our case the event X < x is the set (0,x). 


In the case of a discrete random variable, we saw how the probability of an event was the 
sum of the probabilities of the individual outcomes comprising that event (e.g. P(X < 3) = 
P(X =1)+ P(X = 2)+ P(X =3) in the die example). For a continuous random variable, the 
probability of an event will instead be the integral of a function, which we will now describe. 


Let X be a continuous real-valued random variable on a sample space ©2 in R. For sim- 


3For a proof see p. 9-10 in KAMKE, E., Theory of Sets, New York: Dover, 1950. 
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plicity, let ©2 = (a,b). Define the distribution function F of X as 


F(x) = P(X =x), for -o00<x%x <oo (3.33) 
1, for x > b 
= 4 P(X =x), fora<x<b (3.34) 
0, for x Sa. 


Suppose that there is a nonnegative, continuous real-valued function f on R such that 
XxX 
F(x) = 1 f(v dy, for -o0<x<oo, (3.35) 
—0oO 


and 


OO 
| fear =1. (3.36) 
—0oO 
Then we call f the probability density function (or p.d.f for short) for X. We thus have 
xX 
P(X <x) = ا‎ f(Ydy, fora<x<D. (3.37) 
a 
Also, by the Fundamental Theorem of Calculus, we have 
F'(x) = f(x), for =o0<x <oo. (3.38) 


Example 3.15. Let X represent a randomly selected real number in the interval (0, 1). We 
say that X has the uniform distribution on (0, 1), with distribution function 


ى 


, forx>=>1 
F(x) = P(X <x) = 1x, forO0<x<1 (3.39) 


, forxs0, 


Oo 


and probability density function 


1, forO0<x<1 
fe) = F'(%) = ا‎ (3.40) 
0, elsewhere. 
In general, if X represents a randomly selected real number in an interval (a, b), then X has 
the uniform distribution function 


1, for x > b 
F(x) = P(X <x) = 1 7&, fora<x<b (3.41( 
0, forxsSa, 


and probability density function 


deh. 
aR Os | a TEES (3.492) 


0, elsewhere. 


130 CHAPTER 3. MULTIPLE INTEGRALS 


Example 3.16. A famous distribution function is given by the standard normal distribution, 
whose probability density function f is 


1 
f(0) = e ™ 2 , for o0 <4 < oo. (3.43) 


Var 


This is often called a “bell curve”, and is used widely in statistics. Since we are claiming that 
f isa p.d.f., we should have 


2/- ا 
e dx =1 (3.44)‏ 
2r‏ ۷ 1 


by formula (3.36), which is equivalent to 
9 -/2 
1 e dx = V2. (3.45) 
—ooO 


We can use a double integral in polar coordinates to verify this integral. First, 


5 ۳ ى‎ +2 dxdy = 2 1 2م‎ d4) dy 
1 2 ds) 5 e2 d9 
= 5 2 ax) 


since the same function is being integrated twice in the middle equation, just with different 
variables. But using polar coordinates, we see that 


o0 foo - +2 2r roo 2/9 
1 1 e * Y “Ax dy = 1 e’ “rdrd0 
-00 00ل‎ 0 0 


2r 
Ev 
ا‎ (7 
0 


2r 2r 
= 1 (0-(-e»a0 = | 1d0 = 2r, 
0 0 


and so 


2r , and hence 


oo 2 
0 2م‎ ds) 


° 2/2 
ا‎ e * “dx = VII. 
ê, 8 
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In addition to individual random variables, we can consider jointly distributed random 
variables. For this, we will let X, Y and Z be three real-valued continuous random variables 
defined on the same sample space ©2 in R (the discussion for two random variables is similar). 
Then the joint distribution function F of X, Y and Z is given by 

F(x,y,2) = P(Xs<x, Y sy, Zz), fOr =00<%,Y,2 <0. (3.46) 


If there is a nonnegative, continuous real-valued function f on RF such that 


2 y 5 
Fey, = | 1 1 f(u,u,w)dududw , for 00< x,y,z <o0O (3.47) 


and dS 
1 1 ا‎ f(x,y,z2dxdydz =1, (3.48) 


then we call f the joint probability density function (or joint p.d.f for short) for X, Y and Z. 
In general, for a, < Dı, d2 < Db», a < b4, we have 


b3 ba b1 
P(a<X <b, a,<Y <b, ag <Z < b4) = 1 1 f(xu,y,z2dxdydz , (3.49) 
a3 a2 a1 


with the < and < symbols interchangeable in any combination. A triple integral, then, can 
be thought of as representing a probability (for a function f which is a p.d.f.). 


Example 3.17. Let a, b, and c be real numbers selected randomly from the interval (0, 1). 
What is the probability that the equation ax + bx + c = 0 has at least one real solution x? 


Solution: We know by the quadratic formula that there is at least 1 
one real solution if bض%‎ -4ac > 0. So we need to calculate P(b-4ac > 
0). We will use three jointly distributed random variables to do this. 1 ي‎ 1 
First, since 0 < a,b,c < 1, we have ۱ MM c= 3 
b”-4ac > 0 > 0 < 4ac < b% <1 © 0 < 2VavVc <b <1, R,' ie 
0 1 1 
1 1 
where the last relation holds for all 0 < a,c < 1 such that 4 
1 Figure 3.7.1 Region 
DT R=R,UR; 
a 


Considering a, b and c as real variables, the region R in the ac-plane where the above 
relation holds is given by R = {(a,c):0<a<1l, 0<c<1l, 0<c< 3}, which we can see is a 
union of two regions R, and R,;, as in Figure 3.7.1 above. 

Now let X, Y and Z be continuous random variables, each representing a randomly se- 
lected real number from the interval (0, 1) (think of X, Y and Z representing a, b and c, 
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respectively). Then, similar to how we showed that f(x) = 1 is the p.d.f. of the uniform dis- 
tribution on (0, 1), it can be shown that f(x, y,z) = 1 for x, y, z in (0,1) 
(0 elsewhere) is the joint p.d.f. of X, Y and Z. Now, 


P(b-4ac>0) = P((a,c)JeR, AVavcsb<1), 


so this probability is the triple integral of f(a, b,c) = 1 as b varies from 27a ce to 1 and as 
(a,c) varies over the region R. Since R can be divided into two regions R, and R;, then the 
required triple integral can be split into a sum of two triple integrals, using vertical slices in 
R: 


1/4 ^1 r1 1 1/4a r1 
P(b-4ac >0) 0 1 1 1dbdcda >| 1 1 1dbdcda 
0 0 J2VavVce 1/4 J0 27a Vc 


R1 R2 


1/4 n1 1 r1/4a 

1 (1- 2Va Vc)dcda + 1 (1-2Va Vc)dcda 
0 0ل‎ 1/40 
1/4 c=1 1 c=1/4 

= (e-fvae| | aa | (- fa ° | 
0 =0 1/4 


=0 
1/4 1 1 
1-3Va da + | 3-da 
j a-fvmae+ f B 


1/4 1 


> 2 + —]na 
9 0 12 1/4 
1 1 1 1 5 1 
ج‎ 1 + In 1 ک‎ + In4 
4 9 12 4 36 12 
5+3ln4 
P(b-4qac >0) = س‎ ~ı 0.254 


In other words, the equation ax + bx + c = 0 has about a 25% chance of being solved! 


Expected Value 


The expected value EX of a random variable X can be thought of as the “average” value 
of X as it varies over its sample space. If X is a discrete random variable, then 


EX = Y xP(X =»), (3.50) 


with the sum being taken over all elements x of the sample space. For example, if X repre- 
sents the number rolled on a six-sided die, then 


6 6 
1 
EX = JY xP(X =x) = 6 = 85 (3.51( 
x=1 


x=1 


is the expected value of X, which is the average of the integers 1 — 6. 
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If X is a real-valued continuous random variable with p.d.f. f, then 
OO 
EX = 1 xf(x)dx . (3.52) 
—0oO 


For example, if X has the uniform distribution on the interval (0, 1), then its p.d.f. is 


1, forO0<x<1 
() = 4 (3.53) 
أ‎ elsewhere, 


and so 
1 


oo 1 
EX = ا‎ xf(x)(dx = xdx = 9° (3.54) 
-00 0 


For a pair of jointly distributed, real-valued continuous random variables X and Y with 
joint p.d.f f(x,y), the expected values of X and Y are given by 


EX | 1 xf(x,yJ)dxdy and EY ت‎ 1 yf(xr,y)dxdy , (3.55) 
respectively. 


Example 3.18. If you were to pick n > 2 random real numbers from the interval (0, 1), what 
are the expected values for the smallest and largest of those numbers? 


Solution: Let U,,...,U, be n continuous random variables, each representing a randomly 
selected real number from (0, 1), i.e. each has the uniform distribution on (0, 1). Define 
random variables X and Y by 


X =min(U,,...,UÜ,) and Y =max(U,,..., Uy). 
Then it can be shown* that the joint p.d.f. of X and Y is 


n(n-1)(y-x)"™ 2, forOsxsys1 


f)e,y( = | (3.56) 


0, elsewhere. 


Thus, the expected value of X is 


1 r1 
EX =| 1 n(n- Dx(y-x)" 2 dydx 
0 Jx 


1 2 
= (rao | | 
0 =» 


1 
1 nx(1—x)™ dx , so integration by parts yields 
0 


1 
0 


1 
-)1- س"(‎ )1- (1 
n+1 
1 
n+1 


4See Ch. 6 in HOEL, PORT and STONE. 
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and similarly (see Exercise 3) it can be shown that 


1 
EY = | na- Dy(y-o) dxdy = ` 
0 Jo n+1 
So, for example, if you were to repeatedly take samples of n = 3 random real numbers from 
(0, 1), and each time store the minimum and maximum values in the sample, then the aver- 
age of the minimums would approach 1 and the average of the maximums would approach 
as the number of samples grows. It would be relatively simple (see Exercise 4) to write a 


computer program to test this. 


Exercises 
B 


1. Evaluate the integral JS, e dx using anything you have learned so far. 
1 -)-ر‎ 0/20 
2. For o >0 and u >0, evaluate fS, ze 20 qx. 


3. Show that EY = & in Example 3.18 
C 


4. Write a computer program (in the language of your choice) that verifies the results in 
Example 3.18 for the case n = 3 by taking large numbers of samples. 


5. Repeat Exercise 4 for the case when n =4. 


6. For continuous random variables X, Y with joint p.d.f f(x, y), define the second moments 
E(X) and E(Y) by 


E(x) = 2 f(r, dxdy and E(¥®) = lL f(r,Wdxdy, 
and the variances Var(X) and Var(Y ) by 
Var(X) = E(X)-(EXY and Var(Y) = E(Y)-(EY) . 
Find Var(X) and Var(Y) for X and Y as in Example 3.18. 


1. Continuing Exercise 6, the correlation p between X and Y is defined as 
_ E(XY)-(EXXEY) 
VVar(X)Var) 
where E(XY)= fS J opxyf(x,y)dxdy. Find p for X and Y as in Example 3.18. 
(Note: The quantity E(XY)-(EX)((EY) is called the covariance of X and Y.) 


8. In Example 3.17 would the answer change if the interval (0, 100) is used instead of (0, 1)? 
Explain. 


4 Line and Surface Integrals 


4.1 Line Integrals 


In single-variable calculus you learned how to integrate a real-valued function f(x) over an 
interval [a,b] in R'. This integral (usually called a Riemann integral) can be thought of as 
an integral over a path in R, since an interval (or collection of intervals) is really the only 
kind of “path” in R'. You may also recall that if f(x) represented the force applied along the 
x-axis to an object at position x in [a,b], then the work W done in moving that object from 
position x = a to x = b was defined as the integral: 


b 
Ww = | f(x)dx 


In this section, we will see how to define the integral of a function (either real-valued or 
vector-valued) of two variables over a general path (i.e. a curve) in R2. This definition will 
be motivated by the physical notion of work. We will begin with real-valued functions of two 
variables. 

In physics, the intuitive idea of work is that 


Work = Force x Distance . 


Suppose that we want to find the total amount W of work done in moving an object along a 
curve C in R with a smooth parametrization x = x(t), y = y(t), a = t = b, with a force f(t, y) 
which varies with the position (x, y) of the object and is applied in the direction of motion 
along C (see Figure 4.1.1 below). 


0 
Figure 4.1.1 Curve C:x = x(t),y = y(t) for t in [a,b] 


We will assume for now that the function f(x, y) is continuous and real-valued, so we only 
consider the magnitude of the force. Partition the interval [a, b| as follows: 


a=to <tı <a <°**<,-ı <, = b , for some integer n > 2 
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As we can see from Figure 4.1.1, over a typical subinterval [t;,t;,,] the distance As; traveled 


along the curve is approximately ı/ Ax; + Ay;7, by the Pythagorean Theorem. Thus, if the 
subinterval is small enough then the work done in moving the object along that piece of the 
curve is approximately 


Force x Distance x f(%;.,7y;.)V Ax; + Ay; , (4.1( 


where (%;., Y;.) = (x(t;*), y(t;*)) for some t;* in [t;,t;,,], and so 


n-1 
W x 3 f (x: Y;.)/ Ax; + Ay; (4.2) 
1=0 


is approximately the total amount of work done over the entire curve. But since 


( 
Ax; + Ay; 2| ك‎ At, , 
where At; = t;,, —t;, then 
W x >f )ا‎ AE (4.3) 
Xix #9 J 1* E6 . 
YN a: At, 


Taking the limit of that sum as the length of the largest subinterval goes to 0, the sum over 
all subintervals becomes the integral from t = a to t = b, i and i i become x(t) and y(t), 
respectively, and f(%;,, y;,) becomes f (x(t), y(t)), so that 


b 
= 1 ESSE EE dE 4.4) 


The integral on the right side of the above equation gives us our idea of how to define, 
for any real-valued function f(x,y), the integral of f(x,y) along the curve C, called a line 
integral: 


Definition 4.1. For a real-valued function f(x,y) and a curve C in R, parametrized by 
x = x(t), y = y(t), a <=t = b, the line integral of f(x, y) along C with respect to arc length 


8S is 
b 
1 ا‎ e (4.5) 


The symbol ds is the differential of the arc length function 


1 
8 = ا = ( )ی‎ \/x(u)2 + y'(u)2 du , (4.6( 
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which you may recognize from Section 1.9 as the length of the curve C over the interval [a,t], 


for all t in [a,b]. That is, 
ds = s(t)dt = J x'()2+ y'(D2 dt , (4.7) 


by the Fundamental Theorem of Calculus. 

For a general real-valued function f(x, y), what does the line integral Je f(x,y)ds rep- 
resent? The preceding discussion of ds gives us a clue. You can think of differentials as 
infinitesimal lengths. So if you think of f(x, y) as the height of a picket fence along C, then 
f(x,y)ds can be thought of as approximately the area of a section of that fence over some 
infinitesimally small section of the curve, and thus the line integral J/g f(x,y)ds is the total 
area of that picket fence (see Figure 4.1.2). 


Xx 


> 


0 
Figure 4.1.2 Area of shaded rectangle = height x width x f(x, y)ds 


Example 4.1. Use a line integral to show that the lateral surface area A of a right circular 
cylinder of radius r and height A is 2rrh. 


Solution: We will use the right circular cylinder with base circle C 
given by x + y7 = r and with height A in the positive z direction 
(see Figure 4.1.3). Parametrize C as follows: 


x = x(t) = rcost, Yy = y(t) = rsint, Osts2r = f(,۷( 


Let f(x,y) = h for all (x, y). Then 


A = [fends = | f (x(t), y(t) ¥ (E) + y(t) dt 


2r 
=| hv (-rsint)® + (rcost)® dt 
0 
2r 
n rV sin t+ cost dt 
0 


2r 
rh | 1dt = 2rrh 
0 


C:47 + ر‎ = 


Figure 4.1.3 


1 
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Note in Example 4.1 that if we had traversed the circle C twice, i.e. let t vary from 0 to 
Ar, then we would have gotten an area of 4rrh, i.e. twice the desired area, even though the 
curve itself is still the same (namely, a circle of radius r). Also, notice that we traversed the 
circle in the counter-clockwise direction. If we had gone in the clockwise direction, using the 
parametrization 


x = x(t) = rcos(2r—t), y = y(t) = rsin(Ar—-t), Osts2r, (4.8) 


then it is easy to verify (see Exercise 12) that the value of the line integral is unchanged. 

In general, it can be shown (see Exercise 15) that reversing the direction in which a curve 
C is traversed leaves f, f(x,y)ds unchanged, for any f(x, y). If a curve C has a parametriza- 
tion x = x(t), y = y(t), a = t <= b, then denote by —C the same curve as C but traversed in the 
opposite direction. Then —C is parametrized by 


x = x(a+b-t), y = y(a+b-t), astsb, (4.9) 


and we have 


|fenas a f(xr,y)ds . (4.10( 
EC -C 


Notice that our definition of the line integral was with respect to the arc length parameter 
s. We can also define 


b 
[fena ا‎ f (x(D,y(t)x (OD dt (4.11( 


as the line integral of f(x, y) along C with respect to x, and 


b 
٣ f(x,y)dy = 1 f(D, y(t)y(Ddt (4.12( 


as the line integral of f(x,y) along C with respect to Y. 

In the derivation of the formula for a line integral, we used the idea of work as force 
multiplied by distance. However, we know that force is actually a vector. So it would be 
helpful to develop a vector form for a line integral. For this, suppose that we have a function 
f(x,y) defined on R by 


f(x,y) = P(x,y)i + Q(x, 9)j 
for some continuous real-valued functions P(x,y) and Q(x,y) on R2. Such a function f is 


called a vector field on RJ. It is defined at points in R, and its values are vectors in R2. For 
a curve C with a smooth parametrization x = x(t), y = y(t), a <t <= b, let 


r(t) = x(t)i + y()j 
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be the position vector for a point (x(t), y(t)) on C. Then r'(t) = x'(t)i+ y'(t)j and so 


b b 
[Pena 4 |8» = | P(x(t),y(t))x (Ddt + | Q((D,y(D)y (O dt 
b 
ت‎ 1 (PCD, y(t) (D+ QOD, (Dy (O) dt 
b 
ا‎ f(D), y(t)“ (Ddt 


by definition of f(x, y). Notice that the function f(<(t), y(t))-r'(t) is a real-valued function on 
[a,b], so the last integral on the right looks somewhat similar to our earlier definition of a 
line integral. This leads us to the following definition: 


Definition 4.2. For a vector field f(x,y) = P(x,y)i+ Q(x,y)] and a curve C with a smooth 
parametrization x = x(t), y = y(t), a = t = b, the line integral of f along C is 


far = | Perna + 1 Q(x,y)dy (4.13)‏ ا 
C C C‏ 
b‏ 
fed)-r'at, (4.14)‏ | = 
da‏ 
where r(t) = x(t)i + y(t)] is the position vector for points on C.‏ 


We use the notation dr = r'(t)dt = dxi+ dyj to denote the differential of the vector-valued 
function r. The line integral in Definition 4.2 is often called a line integral of a vector field 
to distinguish it from the line integral in Definition 4.1 which is called a line integral of a 
scalar field. For convenience we will often write 


| Pena + |8» ڪِ‎ |Penar+ Qend, 


where it is understood that the line integral along C is being applied to both P and Q. The 
quantity P(x, y)dx + Q(x, y)dy is known as a differential form. For a real-valued function 
F(x, y), the differential of F is dF = 9E dx+ dy. A differential form P(x, y)dx + Q(x, y)dy 
is called exact if it equals QF" for some function F(x, y). 

Recall that if the points on a curve C have position vector r(t) = x(t)i+ y(t)j, then r'’(t) is a 
tangent vector to C at the point (x(t), y(t)) in the direction of increasing t (which we call the 
direction of C). Since C is a smooth curve, then r’(t) ¥ 0 on [a,b] and hence 


r'(t) 


T(L) = 
2 = r] 


is the unit tangent vector to C at (x(t), y(t)). Putting Definitions 4.1 and 4.2 together we get 
the following theorem: 
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Theorem 4.1. For a vector field f(x,y) = P(x,y)i+ Q(x,y)] and a curve C with a smooth 
parametrization x = x(t), y = y(t), a <t <= b and position vector r(t) = x(t)i+ y(t), 


| tar = |tTas, (4.15) 
C C 


where T(t) = r is the unit tangent vector to C at (x(t), y(t)). 


lr 


If the vector field f(x, y) represents the force moving an object along a curve C, then the work 
W done by this force is 


W = | Tas = far. (4.16) 
C C 


Example 4.2. Evaluate )مل‎ + y)dx + 2xydy, where: 
(a) C:x=t, y=, OstS1 


(b) C:x=t, y=2, Osts1 


Solution: Figure 4.1.4 shows both curves. ۷ )12( 
(a) Since x(t) = 1 and y'(t) = 2, then 


1 
(x + y dx + Ixy dy = (((t) + y(t))x (E) + 2x()y(t)y'(t)) dt 
0 


1 
1 (( + 412)(1)+ 2(21)(2)) dt j 3 

Figure 4.1.4‏ ا 
dt‏ غ13 1 

0 
_ 138| _ 83 

3 l0 3 


(b) Since x(t) = 1 and y'(t) = 4t, then 


1 
(x + y dx + Ixy dy = 1 ((e(t) + y(t)%)x (E) + 2x()y(t)y'(t)) dt 
0 


1 
((t + 4*)(1) + 2(2)(4t)) dt 
0 
1 
= (t + 20t%)dt 
0 
3 1 
> ٣ ا = کرړ ب‎ 
3 ر‎ 3 3 
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So in both cases, if the vector field f(x, y) = (2 + y2)i+ 2xyj represents the force moving an 
object from (0,0) to (1,2) along the given curve C, then the work done is 3. This may lead 
you to think that work (and more generally, the line integral of a vector field) is independent 
of the path taken. However, as we will see in the next section, this is not always the case. 


Although we defined line integrals over a single smooth curve, if C is a piecewise smooth 
curve, that is 
CEC UC UC, 


is the union of smooth curves C,,...,C,, then we can define 
1 fdr = 1 fdr, + ا‎ f‘dr, +...+ 1 fdr, 
C C1 C2 Cn 
where each r; is the position vector of the curve C:. 


Example 4.3. Evaluate م‎ + y)dx + 2xydy, where C is the polygonal path from (0, 0) to 
(0,2) to (1,2). 


Solution: Write C = Cı, UC», where C, is the curve given by x =0, y =t, 
0 =t=<2 and C, is the curve given by x =t, y =2, 0 st = 1 (see Figure 
4.1.5). Then 


2 


| 2+92 a+ 2y dy ك‎ (2 + y dx + 2y dy 
C C1 


+ 1 (2 + y dx + 2xydy 
C2 


Figure 4.1.5 


2 1 
= ((0 + t*)(0) + 2(0)t(1)) dt + ((t + 4)(1) + 2t(2)(0)) dt 
0 0 


2 1 
اك‎ oat+ | (+4 dt 
0 0 
3 1 183 
= — +4 = +4 = 
3 ر‎ 3 


Line integral notation varies quite a bit. For example, in physics it is common to see the 
notation f f-d1, where it is understood that the limits of integration a and b are for the 
underlying parameter t of the curve, and the letter 1 signifies length. Also, the formulation 
Jef: Tds from Theorem 4.1 is often preferred in physics since it emphasizes the idea of 
integrating the tangential component fT of fin the direction of'T (i.e. in the direction of C), 
which is a useful physical interpretation of line integrals. 
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Exercises 


A 
For Exercises 1-4, calculate fe f(x, y)ds for the given function f(x,y) and curve C. 


1. f(x,y)=xy; C:x=cost, y= sint, Ost <s r/2 


2. f )e,ر(‎ = س‎ CxS y=0, 0S1 51 
x 


+ 
3. f(x,yJ)=2x+y; C: polygonal path from (0, 0) to (3,0) to (3,2) 


4. f(x,y)=x+y; C: path from (2,0) counterclockwise along the circle x + y% = 4 to the 
point (2,0) and then back to (2,0) along the x-axis 


5. Use a line integral to find the lateral surface area of the part of the cylinder 
x + y = 4 below the plane x + 2y + z = 6 and above the xy-plane. 


For Exercises 6-11, calculate /ç f‘ dr for the given vector field f(x, y) and curve C. 
6. f(x,y)=1i—-]j;j C:x=3t,y=2t,0sts1 

1. f(x,y)= yi-x]j; C:x=cost, y= sint, Ost s2r 

8. f(x,y)=xi+ yj; C:x=cost, y= sint, Ost <= 2r 

9. f(x,y) = (x? —- y)i+(x—y2)j; C:x=cost, y= sint, Ost < 2r 

10. f(x,y)=xy2i+xy3j; C : the polygonal path from (0,0) to (1,0) to (0, 1) to (0,0) 
11. f(x,y)= (x+y); C:x=2+ cost, y= sint, O St < 2r 

B 


12. Verify that the value of the line integral in Example 4.1 is unchanged when using the 
parametrization of the circle C given in formulas (4.8). 


13. Show that if f Lr'(t) at each point r(t) along a smooth curve C, then Jof‘dr =0. 


14. Show that if f points in the same direction as r'’(t) at each point r(t) along a smooth 
curve C, then Jof‘dr = Jollfll ds. 


C 
15. Prove that Joe f(x,y)ds = f_of(x,y)ds. (Hint: Use formulas (4.9).) 


16. Let C be a smooth curve with arc length L, and suppose that f(x, y) = P(x, y)i+ Q(x, y)] 
is a vector field such that |lf(x, y)|l| <> M for all (x, y) on C. Show that 
|Jef‘dr| <= ML. (Hint: Recall that ا‎ g()dx| = HE) dx for Riemann integrals.) 


17. Prove that the Riemann integral f(x)dx is a special case of a line integral. 
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4.2 Properties of Line Integrals 


We know from the previous section that for line integrals of real-valued functions (scalar 
fields), reversing the direction in which the integral is taken along a curve does not change 
the value of the line integral: 


| renas= | f(x,y)ds (4.17( 
Cc -C 


For line integrals of vector fields, however, the value does change. To see this, let f(x,y) = 
P(x,y)i+ Q(x,y)] be a vector field, with P and Q continuously differentiable functions. Let 
C be a smooth curve parametrized by x = x(t), y = y(t), a = t = b, with position vector r(t) = 
x(t)i+ y(t)j (we will usually abbreviate this by saying that C :r(t) = x(t)i+ y(t)]j is a smooth 
curve). We know that the curve —C traversed in the opposite direction is parametrized by 
x= x(a +b -—-t), y = y(a +b —t), a <t < b. Then 


b 
1 P(x,y)dx = 1 P(x(a +b —-t),y(a + b—t)) (ta +b-t)dt 
-C a 
b 
> ا‎ P(x(a +b -t),y(a + b-t))(-x'(a + b-t))dt (by the Chain Rule) 


Penns u)cau) (by letting u =a +b —t)‏ | ت 


BO DE GD 


b b 
-| P(x(u),y(u))x'(u)du , since j -- | , SO 
1 P(x,y)dx = | P(x,y)dx 

-C C 


since we are just using a different letter (u) for the line integral along C. A similar argument 
shows that 


1 Qe y)dy = = 1 Qad, 
-C C 


1 fdr 
-€C 


and hence 


1 Peds+ | Q(x,y)dy 

-C -C 

-| P(e )ds+ = | Q(x,Y)dy 
C C 


-( Pee,sdx+ | Qe.) 

C C 

| fdr. (4.18) 
C 


1 fdr 
-C 
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The above formula can be interpreted in terms of the work done by a force f(x, y) (treated 
as a vector) moving an object along a curve C: the total work performed moving the object 
along C from its initial point to its terminal point, and then back to the initial point moving 
backwards along the same path, is zero. This is because when force is considered as a vector, 
direction is accounted for. 

The preceding discussion shows the importance of always taking the direction of the curve 
into account when using line integrals of vector fields. For this reason, the curves in line 
integrals are sometimes referred to as directed curves or oriented curves. 

Recall that our definition of a line integral required that we have a parametrization x = 
x(t), y = y(t), a <= t <= b for the curve C. But as we know, any curve has infinitely many 
parametrizations. So could we get a different value for a line integral using some other 
parametrization of C, say, x = %(u), y = J(u), c su <s d ? If so, this would mean that our 
definition is not well-defined. Luckily, it turns out that the value of a line integral of a 
vector field is unchanged as long as the direction of the curve C is preserved by whatever 
parametrization is chosen: 


Theorem 4.2. Let f(x,y) = P(x,y)i+ Q(x,y)] be a vector field, and let C be a smooth curve 
parametrized by x = x(t), y = y(t), a < t = b. Suppose that t = a(u) for c <u < d, such that 
a = a(c), b = a(d), and a'(u) > 0 on the open interval (c,d) (i.e. a(u) is strictly increasing on 
[c,d1). Then Jc fdr has the same value for the parametrizations x = x(), y = y(t), a = t = b 
and x = (u) = x(a(u)), y = J(u) = y(a(u)), ce su < d. 


Proof: Since «a(u) is strictly increasing and maps [c,d] onto [a,b], then we know that t = 
a(u) has inverse function u = a7(t) defined on [a,b] such that c = a7l(a), d = a7 I(b), 
and d4 = e. Also, dt = a'(u)du, and by the Chain Rule 


x! ك‎ dê _ کک‎ dx dt س‎ / / ۹ £ (u) 
& (u) = ا‎ ea) = E (a (u) > x(t) = 0 


so making the susbstitution t = a(u) gives 


£ 


«a *(b) %' (u) 


b 
| Pecmyens'(nat = f“ Poatun,stau) 
a «7 (۾)1‎ a'(u) 


d 
1 P(R(u),F(u)x(u)du , 


(a'(u)du) 


which shows that fo P(x,y)dx has the same value for both parametrizations. A similar 
argument shows that f, Q(x, y)dy has the same value for both parametrizations, and hence 
Jef: dr has the same value. QED 


Notice that the condition a'(u) > 0 in Theorem 4.2 means that the two parametrizations 
move along C in the same direction. That was not the case with the “reverse” parametriza- 
tion for —C: for u = a +b —t we have t= a(u)= a+b -—u > a'(u)= —-1 <0. 


4.2 Properties of Line Integrals 145 


Example 4.4. Evaluate the line integral Jo + y)dx + 2xydy from Example 4.2, Section 
4.1, along the curve C:x=t, y= 2t, 0<t <1, where t= sinu for 0 <u </2. 


Solution: First, we notice that 0 = sinO, 1 = sin(r/2), and = cosu > 0 on (0,71/2). So by 
Theorem 4.2 we know that if C is parametrized by 


x=sinu, y= 2sinîu, Osu sr/2 


then )مل‎ + y2)dx+2xydy should have the same value as we found in Example 4.2, namely 
18, And we can indeed verify this: 


r/2 
ا‎ (2 + y2 )dx +2xydy = ((sin u + (2sinu)*)cosu + 2(sinu)(2 sin u)4sin u cos u) du 
C 0 


r/2 
ج‎ (sin u + 20sin*u) cosu du 
0 


sin u 8 
+4sin u 
3 0 
1 1 
= +4 = 
3 3 


In other words, the line integral is unchanged whether t or u is the parameter for C. 


By a closed curve, we mean a curve C whose initial point and terminal point are the 
same, i.e. for C:x = x(t), y = y(t), a <=t = b, we have (x(a), y(a)) = (x(b), y(b)). 


C € 
(a) Closed (b) Not closed 


Figure 4.2.1 Closed vs nonclosed curves 
A simple closed curve is a closed curve which does not intersect itself. Note that any 


closed curve can be regarded as a union of simple closed curves (think of the loops in a figure 
eight). We use the special notation 


{fends and {far 
C C 


to denote line integrals of scalar and vector fields, respectively, along closed curves. In some 
older texts you may see the notation f or (D to indicate a line integral traversing a closed 


curve in a counterclockwise or clockwise direction, respectively. 


146 CHAPTER 4. LINE AND SURFACE INTEGRALS 


So far, the examples we have seen of line integrals (e.g. Example 4.2) have had the same 
value for different curves joining the initial point to the terminal point. That is, the line 
integral has been independent of the path joining the two points. As we mentioned before, 
this is not always the case. The following theorem gives a necessary and sufficient condition 
for this path independence: 


Theorem 4.3. In a region R, the line integral Jo f° dr is independent of the path between 
any two points in R if and only if $, f‘ dr = 0 for every closed curve C which is contained in 
E. 


Proof: Suppose that f, f‘ dr = 0 for every closed curve C which is contained in R. Let P, 
and P, be two distinct points in AR. Let C, be a curve in R going from P, to P,, and let C, 
be another curve in R going from P, to P,», as in Figure 4.2.2. 


Then Cڇ‎ = C,U-—C, is a closed curve in R (from P, to CC 
P,), and so $f‘ dr = 0. Thus, 
0 = {far P, P» 
C 
ّ fdr + ا‎ fdr 
C1 C2 Ca 
Figure 4.2.2 


5 fdr — | f‘dr , and so 
C1 C2 
Je, fdr = fe, f: dr. This proves path independence. 
Conversely, suppose that the line integral /« fdr is independent of the path between any 
two points in R. Let C be a closed curve contained in R. Let P, and P, be two distinct points 
on C. Let C, be a part of the curve C that goes from P, to P,, and let C, be the remaining 
part of C that goes from P, to P,, again as in Figure 4.2.2. Then by path independence we 
have 


fdr 


0 
a 
1 


C1 C2 


mh 
AQ 
۳ 
| 
بسچ‎ 
س‎ 
Q 
ب‎ 
1 
° 


mh 
a 
E 
4 
9 
a 
E 
1 
o 
n 
o 


since C= C,U-—-C; . QED 
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Clearly, the above theorem does not give a practical way to determine path independence, 
since it is impossible to check the line integrals around all possible closed curves in a region. 
What it mostly does is give an idea of the way in which line integrals behave, and how seem- 
ingly unrelated line integrals can be related (in this case, a specific line integral between 
two points and all line integrals around closed curves). 

For a more practical method for determining path independence, we first need a version 
of the Chain Rule for multivariable functions: 


Theorem 4.4. (Chain Rule) If z = f(x,y) is a continuously differentiable function of x and 
y, and both x = x(t) and y = y(t) are differentiable functions of t, then z is a differentiable 


function of t, and 
dz  dzdx dzdy 


du a 


at all points where the derivatives on the right are defined. 


(4.19) 


The proof is virtually identical to the proof of Theorem 2.2 from Section 2.4 (which uses the 
Mean Value Theorem), so we omit it." We will now use this Chain Rule to prove the following 
sufficient condition for path independence of line integrals: 


Theorem 4.5. Let f(x,y) = P(x,y)i+ Q(x,y)j be a vector field in some region R, with P and 
Q continuously differentiable functions on R. Let C be a smooth curve in RR parametrized 
by x = x(t), y = y(t), a < t <= b. Suppose that there is a real-valued function F(x, y) such that 
VF =f on R. Then 


1 f‘dr = F(B) - F(A), (4.20) 
C 


where A = (x(a), y(a)) and B = (x(b), y(b)) are the endpoints of C. Thus, the line integral is 
independent of the path between its endpoints, since it depends only on the values of F at 
those endpoints. 


Proof: By definition of Jo f‘ dr, we have 


b 
a= ا‎ (PCD, y(t) (+ Q(t), y()y'() dt 


b 
=| + 2) dt (since VF =f > ad ر(‎ 
a \Ox dt dy dt dx dy 


b 
> 1 F'(x(t), y(t))dt (by the Chain Rule in Theorem 4.4) 
b 
- F(t), (t)| = F(B) - F(A) 


by the Fundamental Theorem of Calculus. QED 


1See TAYLOR and MANN, § 6.5. 
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Theorem 4.5 can be thought of as the line integral version of the Fundamental Theorem 
of Calculus. A real-valued function F(x, y) such that VF(x, y) = f(x, y) is called a potential 
for f. A conservative vector field is one which has a potential. 


Example 4.5. Recall from Examples 4.2 and 4.3 in Section 4.1 that the line integral Je + 
y)dx + 2xydy was found to have the value 1 for three different curves C going from the 
point (0, 0) to the point (1,2). Use Theorem 4.5 to show that this line integral is indeed path 
independent. 


Solution: We need to find a real-valued function F(x, y) such that 


and =2 .‏ ا 

dx dy 
Suppose that 9E = x2 + y, Then we must have F(x, y) = ڈ3‎ + xy + g(y) for some function 
g(y). So = 2xy + g'(y) satisfies the condition . = 2xy if g'(y) = 0, i.e. g(y) = K, where K 
is a constant. Since any choice for K will do (why?), we pick K = 0. Thus, a potential F(x, y) 
for f(x,y) = (x2 + y)i+ 2xyj exists, namely 


1 
Fay) = g1 +. 


Hence the line integral J + y)dx + 2xy dy is path independent. 
Note that we can also verify that the value of the line integral of f along any curve C going 
from (0,0) to (1,2) will always be 8, since by Theorem 4.5 

13 


| far = F)1,2) - F(0,0( = ۔ ا ا = (0+ 0 -(2 0 + 3ر1‎ 
C 3 3 3 


A consequence of Theorem 4.5 in the special case where C is a closed curve, so that the 
endpoints A and B are the same point, is the following important corollary: 


Example 4.6. Evaluate xdx+ ydy for C:x = 2cost, y= 3sint, Ost <27. 
C 


Solution: The vector field f(x,y) = xi+ yj has a potential F(x, y): 
OF 1 

— =× < ۴),y( = - 2 + ر )ع‎ ( , S0 
dx 2 


OF 1 
ر=‎ < عا)ر(=yر‎ < ع)y(‎ = > +K 
dy 2 
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1 1 
for any constant K, so F(x, y) = و‎ + y is a potential for f(x, y). Thus, 


f xdx+ydy > {far = 0 
C C 


N 


by Corollary 4.6, since the curve C is closed (it is the ellipse + . ك‎ 1(; 


Exercises 


Evaluate f (® + yd + 2y dy for C:x = cost, y= sint, Ost < 27. 
C 


Evaluate | (+ yds + 2xy dy for C:x = cost, y= sint, Ost sr. 
C 


.„ Is there a potential F(x, y) for f(x, y) = yi— x3? If so, find one. 
„ Is there a potential F(x, y) for f(x,y) = xi— y3? If so, find one. 


„. Is there a potential F(x, y) for f(x,y) = xyi+ x3 yj? If so, find one. 


. Let f(x,y) and g(x, y) be vector fields, let a and b be constants, and let C be a curve in RÎ. 


Show that 
|atsbe-ar ت‎ 3 fdr + 3 g-dr. 
C C C 


. Let C be a curve whose arc length is L. Show that f, 1ds = L. 


. Let f(x,y) and g(x, y) be continuously differentiable real-valued functions in a region R. 


Show that 
{fva.ar ب‎ 4 gVf ‘dr 
C C 


for any closed curve C in R. (Hint: Use Exercise 21 in Section 2.4.) 


4 


. Let f(x,y) = gyi + RIE for all (x,y) Z (0,0), and C: x = cost, y = sint, O0 = t < 2r. 


x2 +y 


(a) Show that f= VF, for F(x, y) = tan H(y/x). 
(b) Show that fdr = 2r. Does this contradict Corollary 4.6? Explain. 
C 


ډو صي ا0 لن 0 


C 


10. Let g(x) and h(y) be differentiable functions, and let f(x,y) = h(y)i+ g(x)j. Can f have a 


potential F(x, y)? If so, find it. You may assume that F would be smooth. (Hint: Consider 
the mixed partial derivatives of F'.) 
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4.3 Green’s Theorem 


We will now see a way of evaluating the line integral of a smooth vector field around a 
simple closed curve. A vector field f(x,y) = P(x,y)i+ Q(x, y)] is smooth if its component 
functions P(x, y) and Q(x, y) are smooth. We will use Green’s Theorem (sometimes called 
Green’s Theorem in the plane) to relate the line integral around a closed curve with a double 
integral over the region inside the curve: 


Theorem 4.7. (Green’s Theorem) Let R be a region in R whose boundary is a simple 
closed curve C which is piecewise smooth. Let f(x,y) = P(x, y)i+ Q(x, y)] be a smooth vector 
field defined on both R and C. Then 


{far = IE-5)44. (4.21( 
C 9 0x Oy 


where C is traversed so that R is always on the left side of C. 


Proof: We will prove the theorem in the case for a simple region R, that is, where the 
boundary curve C can be written as C = C, UC, in two distinct ways: 


C, = the curve y = y,(x) from the point X, to the point X» (4.22) 
C, = the curve y = y„(x) from the point X, to the point X,, (4.23) 


where X, and X, are the points on C farthest to the left and right, respectively; and 


C, = the curve x = x,(y) from the point Ys to the point Y, (4.24) 
C, = the curve x = x,(y) from the point Y, to the point Ys, (4.25) 


where Y, and Y; are the lowest and highest points, respectively, on C. See Figure 4.3.1. 


( )رر = ر 


£ = x1)y( 


T8 


ب ت ا ی ا ا و 


Figure 4.3.1 


Integrate P(x, y) around C using the representation C = C, UC, given by (4.23) and (4.24). 
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Since y = y,(x) along C, (as x goes from a to b) and y = y„(x) along C,» (as x goes from b to 
a), as we see from Figure 4.3.1, then we have 


f Pend 1 P(x,y)dx >| P(x,y)dx 
C Cı C2 


b a 
= | P(x, yı(x))dx | P(x, y2())dx 


b b 
1 BENNA 1 PESN 


b 
1 BOS BESE 


b =2) ( 
-| (Pe. | a 
a Y=71)%( 


y2) P 
. 1 ا ت‎ ——_-dydx (by the Fundamental Theorem of Calculus) 
71%) 


1 
| 
ة۶ جم 
3% 
a‏ 
> 


Likewise, integrate Q(x, y) around C using the representation C = C, UC, given by (4.25) 
and (4.26). Since x = x,(y) along C, (as y goes from d to c) and x = x„(y) along C, (as y goes 
from c to d), as we see from Figure 4.3.1, then we have 


OEE 1 Qay + 1 Qe 
C C1 C2 


c d 
OOS 1 OS 


d d 
. ا‎ OEE 1 QC), dy 


d 
ا‎ OO) AORN 
d »=2(7( 
(eel) 


x=»1(7( 
x2) 
[١ 1 ك‎ 2 dxdy (by the Fundamental Theorem of Calculus) 
x1)9( 


0 
کے‎ I a4 , and so 
dx 
R 
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f Penas+ Q(x,y)dy 
C C 


ENES 
-(E-) 


{far 
C 


QED 


Though we proved Green’s Theorem only for a simple region R, the theorem can also be 
proved for more general regions (say, a union of simple regions). 


Example 4.7. Evaluate $o + y)dx+2xy dy, where C is the boundary (traversed counter- 
clockwise) of the region R = {(x,y):0 <x <1, 22 sy < 2x}. 

Solution: R is the shaded region in Figure 4.3.2. By Green’s Theorem, for 7 2( 
P(x,y) = x2 + y7 and Q(x, y) = 2xy, we have 2 


fe + y dx + 2xydy = ا‎ C 
ك‎ e ت‎ |] a4 ا‎ 0 1 
R R 


Figure 4.3.2 
We actually already knew that the answer was zero. Recall from Example 4.5 8 ا‎ 
4.2 that the vector field f(r, y) = (x + y)i+ 2xyj has a potential function F(x, y) = ڈ3‎ + 4y, 
and so ff‘ dr = 0 by Corollary 4.6. 


Example 4.8. Let f(x, y) = P(x, y)i+ Q(x, y)j, where 


0 ر 
,ج = (ل,× P(x,y)= +> and‏ 
(t,7 E Q(x, y) ET‏ 


and let R = {(x,y):0 < x + y <1}. For the boundary curve C :x% + y% = 1, traversed counter- 
clockwise, it was shown in Exercise 9(b) in Section 4.2 that $, f‘ dr = 2r. But 


0 E E ج‎ 0. 


This would seem to contradict Green’s Theorem. However, note that R is not the entire 
region enclosed by C, since the point (0,0) is not contained in R. That is, R has a “hole” at 
the origin, so Green’s Theorem does not apply. 


2See TAYLOR and MANN, § 15.31 for a discussion of some of the difficulties involved when the boundary curve 
is “complicated”. 
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If we modify the region R to be the annulus R = j 
{(x,y) : 1/4 < x + y < 1} (see Figure 4.3.3), and take 
the “boundary” C of R to be C = C,UC,, where C, is 
the unit circle x2 + y% = 1 traversed counterclockwise 
and C, is the circle x + ر‎ = 1/4 traversed clockwise, 
then it can be shown (see Exercise 8) that 


{far =0. 
C 


We would still have Jf (¢ ب‎ 92) dA =0, so for this R 
R 


we would have 


۵ aP Figure 4.3.3 The annulus R 
fre (E) a 
C و‎ dx ûy 


which shows that Green’s Theorem holds for the annular region R. 


It turns out that Green’s Theorem can be extended to multiply connected regions, that is, 
regions like the annulus in Example 4.8, which have one or more regions cut out from the 
interior, as opposed to discrete points being cut out. For such regions, the “outer” boundary 
and the “inner” boundaries are traversed so that R is always on the left side. 


(a) Region R with one hole (b) Region R with two holes 


Figure 4.3.4 Multiply connected regions 


The intuitive idea for why Green’s Theorem holds for multiply connected regions is shown 
in Figure 4.3.4 above. The idea is to cut “slits” between the boundaries of a multiply con- 
nected region R so that R is divided into subregions which do not have any “holes”. For 
example, in Figure 4.3.4(a) the region R is the union of the regions R, and R», which are 
divided by the slits indicated by the dashed lines. Those slits are part of the boundary of 
both R, and R,, and we traverse then in the manner indicated by the arrows. Notice that 
along each slit the boundary of R, is traversed in the opposite direction as that of R,, which 
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means that the line integrals of f along those slits cancel each other out. Since R, and R, do 
not have holes in them, then Green’s Theorem holds in each subregion, so that 


fy = I (-aa ne fy, car = I (E-F)aa 


of R1 


But since the line integrals along the slits cancel out, we have 


fdr = f‘dr + f‘dr, 
e bdy bdy 


of R1 of R2 


ا ا و ا 


which shows that Green’s Theorem holds in the region RR. A similar argument shows that 
the theorem holds in the region with two holes shown in Figure 4.3.4(b). 

We know from Corollary 4.6 that when a smooth vector field f(x, y) = P(x, y)i+ Q(x, y)] on 
a region R (whose boundary is a piecewise smooth, simple closed e C) has a potential in 
R, then ef: dr = 0. And if the potential F(x, y) is smooth in R, then =P and = = Q, and 
so we know that 


and so 


oF oF OP __ 0Q 
ك‎ > —=-— inR. 
Qydx dxûy dy ûx 


Conversely, if = in R then 


free |f(-Faa = foes =o, 


For a simply connected region R (i.e. a region with no holes), the following can be shown: 


The following statements are equivalent for a simply connected region R in R®: 


(a) f(x,y) = P(x,y)i+ Q(x, y)] has a smooth potential F(x, y) in R 


(b) ا‎ fdr is independent of the path for any curve C in R 
0 


(e) fdr = 0 for every simple closed curve C in R 
(6 


OP 0 
( 5 = س‎ in A (in this case, the differential form P dx + Q dy is exact) 
x 
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Exercises 
A 


For Exercises 1-4, use Green’s Theorem to evaluate the given line integral around the curve 
C, traversed counterclockwise. 


1. feyde + 2y dy: C is the boundary of R = {(x,y):O0 Sx <1, 2x2 <y < 2x} 
C 


2. xydx+2xydy; C is the boundary of R = {(x,yJ):O0 Sx <1, sys x} 
C 


یں 


ُ 2ydx-3xdy; C is the circle x2 + y= 1 
C 


(e + y2) dx + (e +x%dy; C is the boundary of the triangle with vertices (0, 0), (4,0) 
C 
and (0,4) 


ج 


5. Is there a potential F(x, y) for f(x,y) = (y2 + 3x2)i+ 2xyj? If so, find one. 


6. Is there a potential F(x, y) for f(x,y) = (x3 cos(xcy) + 2x sin(xy))i + xycos(xy)j? If so, find 
one. 


7. Is there a potential F(x, y) for f(r, y) = (8xy + 3)i + 4(x + y)j? If so, find one. 


8. Show that for any constants a, b and any closed simple curve C, adx+bdy =0. 
C 


9. For the vector field f as in Example 4.8, show directly that f, f‘ dr = 0, where C is the 
boundary of the annulus R = {(x,y) : 1/4 < x + y < 1} traversed so that R is always on 
the left. 


10. Evaluate e” siny dx + ڈر)‎ +e” cosy)dy, where C is the boundary of the rectangle with 


vertices (1,—-1), (1, 1), (-1,1) and (-1,—-1), traversed counterclockwise. 
C 


11. For a region R bounded by a simple closed curve C, show that the area A of R is 


1 
A = f yds = f «dy = 3f sdy-ydx; 
C Cc 2 Jc 


where C is traversed so that R is always on the left. (Hint: Use Green's Theorem and the 
fact that A = Jf 1dA.) 
R 
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4.4 Surface Integrals and the Divergence Theorem 


In Section 4.1 we learned how to integrate along a curve. We will now learn how to perform 
integration over a surface in RF, such as a sphere or a paraboloid. Recall from Section 1.8 
how we identified points (x, y,z) on a curve C in R3, parametrized by x = x(t), y = y(t), z2 = z(t), 
a <t <b, with the terminal points of the position vector 


r(t) = x(t)i+ y(t)j + z(t)k for t in [a,b]. 


The idea behind a parametrization of a curve is that it “transforms” a subset of RÎ (nor- 
mally an interval [a, b]) into a curve in R or RF (see Figure 4.4.1). 


4z 
(x(t), y(t),2(£)) 
ص‎ 
x = x(t) 
y= )( ((b),y(b),z(b)) 
z= 2(8) 
RI y 
a t b 0 
xX 


Figure 4.4.1 Parametrization of a curve C in R3 


Similar to how we used a parametrization of a curve to define the line integral along the 
curve, we will use a parametrization of a surface to define a surface integral. We will use 
two variables, u and uv, to parametrize a surface Z in R3: x= x(u,U), JY = J(u,U), 2 = 2(U,U), 
for (u,v) in some region R in R (see Figure 4.4.2). 


1U R2 
1z 
2 
ص‎ 
R Xx = X(u, U) 
6 y= y)u,v( 
(u uv( Z2 = 2(Uu,U) r(u,u) 
y 
ر‎ 
u x 


Figure 4.4.2 Parametrization of a surface X in RF 


In this case, the position vector of a point on the surface Z is given by the vector-valued 
function 
r(u,u) = x(u,u)i + y(u,u)] + z(u,u)k for (u,vu)in R. 
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Since r(u,vu) is a function of two variables, define the partial derivatives 9 and o£ for 
(u,u)in R by 


( 0 
ا(0‎ © e,0) + (uyu)j + (u,yu)k , and 
u du du du 
0 0 
—)u,۷( = © e, + ^ ز(ت ,س‎ + (u,u)k . 
du du du 


The parametrization of XZ can be thought of as “transforming” a region in R (in the uu- 
plane) into a 2-dimensional surface in R3. This parametrization of the surface is sometimes 
called a patch, based on the idea of “patching” the region R onto Z in the grid-like manner 
shown in Figure 4.4.2. 

In fact, those gridlines in R lead us to how we will define a surface integral over Z. Along 
the vertical gridlines in R, the variable u is constant. So those lines get mapped to curves on 
Z, and the variable u is constant E the position vector r(u,u). Thus, the tangent vector 
to those curves at a point (u,v) i is -. Similarly, the horizontal gridlines in R get mapped to 
curves on Z whose tangent O are ge 

Now take a point (u,v) in R as, say, the lower left corner of one of the rectangular grid 
sections in R, as shown in Figure 4.4.2. Suppose that this rectangle has a small width 
and height of Au and Av, respectively. The corner points of that rectangle are (u,v), (u + 
Au,u), (u + Au,u + Au) and (u,u + Av). So the area of that rectangle is A = Au Au. Then that 
rectangle gets mapped by the parametrization onto some section of the surface Z which, 
for Au and Av small enough, will have a surface area (call it do) that is very close to the 
area of the parallelogram which has adjacent sides r(u + Au,u)—r(u,v) (corresponding to the 
line segment from (u,v) to (u + Au,u) in R) and r(u,u + Au) —r(u,u) (corresponding to the 
line segment from (u,v) to (u,u + Au) in R). But by combining our usual notion of a partial 
derivative (see Definition 2.3 in Section 2.2) with that of the derivative of a vector-valued 
function (see Definition 1.12 in Section 1.8) applied to a function of two variables, we have 


Or r(u + Au,u)—r(u,u) 


~ 5 and 

du Au 

Or _ r(u,u+ Av)-r(u,u) 

du Au 

and so the surface area element do is approximately 
Or Or Or ûr 
|| r(u + Au,u)— r(u,u)) x (r(u, u + Av) — r(u, u)) || x | (Au —) x (Au—)|j = x Au Au 

du du du du 


by Theorem 1.13 in Seo 1.4. Thus, the total surface area S of Z is approximately the sum 
of all the quantities lz 9F „ 5 ‫1 Au Av, summed over the rectangles in R. Taking the limit of 
that sum as the ا48‎ of hê largest rectangle goes to 0 gives 


ا 


ûr 
^ „ °" dudu. (4.26) 


du 
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We will write the double integral on the right using the special notation 


“= | 


2 R 


dOr ûr 
 . 


4.27 ر س 
Qu du e (‏ 


This is a special case of a surface integral over the surface Z, where the surface area element 
do can be thought of as 1do. Replacing 1 by a general real-valued function f(x, y,z) defined 
in RF, we have the following: 


Definition 4.3. Let X be a surface in R3 parametrized by x = x(u,u), yJ = y(u,u), 

z = z(u,v), for (u,v) in some region R in R2. Let r(u,u) = x(u,u)i+ y(u,v)] + z(u,u)k be the 
position vector for any point on LZ, and let f(x, y,z) be a real-valued function defined on some 
subset of RÛ that contains XZ. The surface integral of f(x, y,z) over XZ is 


dudu . (4.28) 


| Qr Or 
کی‎ 
Qu du 


|| f.s,2do f (x(u,u), y(u,uU),2(u,U)) 


22 R 


In particular, the surface area S of Z is 


S = | 14e. (4.29) 
2 


Example 4.9. Atorus Tis a surface obtained by revolving a circle of radius a in the yz-plane 
around the z-axis, where the circle’s center is at a distance b from the z-axis (0 < a < b), as 
in Figure 4.4.3. Find the surface area of T. 


= ۶ے + (ط - ر) 


e 


8 b 
(a) Circle in the yz-plane (b) Torus T 


Figure 4.4.3 
Solution: For any point on the circle, the line segment from the center of the circle to that 


point makes an angle u with the y-axis in the positive y direction (see Figure 4.4.3(a)). And 
as the circle revolves around the z-axis, the line segment from the origin to the center of that 
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circle sweeps out an angle u with the positive x-axis (see Figure 4.4.3(b)). Thus, the torus 
can be parametrized as: 


x= (b+acosu)cosvu, y=(b+acosu)sinvu, zZ=asinu, Osus2r, Osus2r 
So for the position vector 


r(u,u) = x(u,u)i + y(u,u)] + z(u,u)k 


(b +acosu)cosvi + (b +acosu)sinvuj + asinuk 


we see that 


Or 1 : 0 
— = -asinu cosvi ¬ asinu sinvuj + acosuk 
du 
Or a : 
ا‎ —(b +acosu)sinvi + (b +acosu)cosvu]j + Ok, 
U 
and so computing the cross product gives 
Or ûr . 1 
3 x a —a(b + acosu)cosu cosui — a(b +acosu)sinuv cosuj — a(b +acosu)sinuk , 
u du 


which has magnitude 
OF Or 


du Bb 


= a(b +acosu) . 


Thus, the surface area of T is 


14 || ک3 
> 
fl‏ 

2r مr2r‎ 
0 1 a(b +acosu)du du 
0 J0 
2r 2 U=2r 
ا‎ (abu +a Sinu | 
0 u=0 


2r 
2rab du 
0 


4rab 


َ 
* ہے‎ ٣ dy 


Since e and ge are tangent to the surface Z (i.e. lie in the tangent plane to Z at each point 


on Z), then their cross product ر ع9‎ ۶ o£ 


at each point of Z. Thus, 


is perpendicular to the tangent plane to the surface 
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J eo J f Ve SAAS, 
ا‎ R 


where n = 9 x or. We say that n is a normal vector to ZL. 


Recall that normal vectors to a plane can point in two opposite 
directions. By an outward unit normal vector to a surface Z, 
we will mean the unit vector that is normal to Z and points away 
from the “top” (or “outer” part) of the surface. This is a hazy 
definition, but the picture in Figure 4.4.4 gives a better idea of 
what outward normal vectors look like, in the case of a sphere. 
With this idea in mind, we make the following definition of a 
surface integral of a 3-dimensional vector field over a surface: 


Figure 4.4.4 


Definition 4.4. Let X be a surface in R and let f(x, y,2) = f(t, y,2)i+ f(t, y,2)j + f(t, y,2)k 
be a vector field defined on some subset of RF that contains XZ. The surface integral of f 


over Z is 
|| tae ڪ‎ || tae, (4.30) 
3 


2 


where, at any point on Z, n is the outward unit normal vector to Z. 


Note in the above definition that the dot product inside the integral on the right is a 
real-valued function, and hence we can use Definition 4.3 to evaluate the integral. 


Example 4.10. Evaluate the surface integral Jf fda, where f(x, y,2) = yzi+xzj + xyk and E 


is the part of the plane x+y+z = 1 with x > 0, y> 0, and z > 0, with the outward unit normal 
n pointing in the positive z direction (see Figure 4.4.5). 


Solution: Since the vector v = (1,1, 1) is normal to the plane x+y+2 =1 


(why?), then dividing v by its length yields the outward unit normal 
1 1 1 
V3 V3 V3 
from Figure 4.4.5, projecting Z onto the xy-plane yields a triangular 
region RR ={(x,y):O0 sx =1, Osys1-—-x}. Thus, using (u,v) instead of 


(x,y), we see that 


vector n = ( ) We now need to parametrize Z. As we can see 


Figure 4.4.5 


x=u, Jy=u, 2=1-(u+u), forOsus1,0sus1l-—u 
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is a parametrization of Z over R (since z= 1—-(x+ y) on EZ). So on LL, 


(ررتد + س جن ےک = ` رک ,)2,۰ = fn‏ 
V3 V3 v3 V3‏ 
1 1 
E‏ ا چ 


( اا + 1(2 + )u‏ -(ں + ا = 


V3 
for (u,v) in R, and for r(u,u) = x(u,u)i+ y(u,vu)j + z2(u,u)k = ui + vj + (1 — (u + u))k we have 


3 3 3 3 
FF a 0-D0 O,1-D = LD و‎ 
du du 


— l= ¥8. 
du v3 


Thus, integrating over R using vertical slices (e.g. as indicated by the dashed line in Figure 


4.4.5) gives 
|| tae 2 || tna 
3 


Þ2 


dudu 


و ت 


dr ûr 
du duv 


J ESS 
R 


1 ^1-u 
س‎ 1 OEE E 
RE 


u=1-u 
| 


u=0 


-[ (u + u) (u +) uv 
0ل‎ 2 3 2 


E 3 , ال‎ 
-+—---—+-—— | du 
0 6 2 2 6 


u u u3 5u® 
غر کے‎ + 
6 4 2 24 


1 


0 8 


Computing surface integrals can often be tedious, especially when the formula for the 
outward unit normal vector at each point of Z changes. The following theorem provides an 
easier way in the case when Z is a closed surface, that is, when Z encloses a bounded 
solid in R3. For example, spheres, cubes, and ellipsoids are closed surfaces, but planes and 
paraboloids are not. 
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Theorem 4.8. (Divergence Theorem) Let X be a closed surface in R3 which bounds a 
solid S$, and let f(x, y,z2) = fı(x,y,z2)i+ f(x, y,2)] + f(x, y,z)k be a vector field defined on some 
subset of R3 that contains XZ. Then 


|| tae 1 div fdV, (4.31) 
2 S 
where 


. 1 2 3 
fı f2 fs 
div f = + 5 + (4.32) 


is called the divergence of f. 


The proof of the Divergence Theorem is very similar to the proof of Green’s Theorem, i.e. it 
is first proved for the simple case when the solid S is bounded above by one surface, bounded 
below by another surface, and bounded laterally by one or more surfaces. The proof can then 
be extended to more general solids. 

Example 4.11. Evaluate J fda, where f(x,y,z) = xi+ yj + zk and Z is the unit sphere 
92 

.1= 2+ 2ر + 42 

Solution: We see that div f= 1+1+1 =3, so 


E 


3م 
3 
و = Jjj 1av‏ 8 
S‏ 


1 


I 


3 


In physical applications, the surface integral Jf fda is often referred to as the flux of f 


through the surface Z. For example, if f E the velocity field of a fluid, then the flux 
is the net quantity of fluid to flow through the surface Z per unit time. A positive flux means 
there is a net flow out of the surface (i.e. in the direction of the outward unit normal vector 
n), while a negative flux indicates a net flow inward (in the direction of —n). 

The term divergence comes from interpreting div f as a measure of how much a vector 
field “diverges” from a point. This is best seen by using another definition of div f which is 
equivalent“ to the definition given by formula (4.32). Namely, for a point (x, y,z) in RF, 


1 
iv f = lim = [| f 4. 
div f(x,y,2) ا‎ da, (4.33) 


3See TAYLOR and MANN, $ 15.6 for the details. 
4See SCHEY, p. 36-39, for an intuitive discussion of this. 
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where V is the volume enclosed by a closed surface Z around the point (x,y,z). In the 
limit, V ك‎ 0 means that we take smaller and smaller closed surfaces around (x, y,z), which 
means that the volumes they enclose are going to zero. It can be shown that this limit is 
independent of the shapes of those surfaces. Notice that the limit being taken is of the 
ratio of the flux through a surface to the volume enclosed by that surface, which gives a 
rough measure of the flow “leaving” a point, as we mentioned. Vector fields which have zero 
divergence are often called solenoidal fields. 
The following theorem is a simple consequence of formula (4.33). 


Theorem 4.9. If the flux of a vector field f is zero through every closed surface containing a 
given point, then div f= 0 at that point. 


Proof: By formula (4.33), at the given point (x,y,z) we have 


1 
div f(<,y,2) jim > j t-te for closed surfaces Z containing (x,Y,Z), SO 
2 


1 
im 7 (0) by our assumption that the flux through each Z is zero, so 


0. QED 


Lastly, we note that sometimes the notation 


f fery,2do and f f.de 
> 


2 


is used to denote surface integrals of scalar and vector fields, respectively, over closed sur- 
faces. Especially in physics texts, it is common to see simply $ instead of $. 
3 ‌ 


Exercises 


A 


For Exercises 1-4, use the Divergence Theorem to evaluate the surface integral /f fda of 
2 


the given vector field f(x, y,z) over the surface Z. 

1. f(x,y,2) = xi+ 2yj + 3zk, >: + y +2 =9 

2. f(<,y,2) = xi+ yj + zk, Z: boundary of the solid cube S = {(x,y,2):0 < x,y,z =1} 
3. f(x,y,2) = x3 1+ ز3ر‎ + 2k, 2:» + y+ 2 =1 


4. f(x,y,2) = 21+ 3j + 5k, >: + y2 +2 =1 
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. Show that the flux of any constant vector field through any closed surface is zero. 


. Evaluate the surface integral from Exercise 2 without using the Divergence Theorem, i.e. 
using only Definition 4.3, as in Example 4.10. Note that there will be a different outward 
unit normal vector to each of the six faces of the cube. 


. Evaluate the surface integral Jf f“ do, where f(x,y,2) = xi + xyj + zk and Z is the part of 


2 
the plane 6x + 3y + 2z = 6 with x > 0, y > 0, and z > 0, with the outward unit normal n 
pointing in the positive z direction. 


„ Use a surface integral to show that the surface area of a sphere of radius r is 4rr2. (Hint: 
Use spherical coordinates to parametrize the sphere.) 


. Use a surface integral to show that the surface area of a right circular cone of radius R 
and height h is rR VA + R2. (Hint: Use the parametrization x = rcosÛO, y =r sinO, z = Ar, 
for OsrsR and 0<0 <2.) 


2 
10. The ellipsoid + و‎ + = 1 can be parametrized using ellipsoidal coordinates 
x = asinQ cosO0 , y= bsinp sinO0, z=ccosp, for Os0 <27 and Os Pps. 


Show that the surface area S of the ellipsoid is 
IT 2r 
= 1 1 sin pV a2b2 cos p + c2(a sin 0 + bcos 0)sin p dO dQ . 
0 J0 


(Note: The above double integral can not be evaluated by elementary means. For specific 
values of a, b and c it can be evaluated using numerical methods. An alternative is to 
express the surface area in terms of elliptic integrals.5) 


11. Use Definition 4.3 to prove that the surface area S over a region R in R of a surface 
z = f(x,y) is given by the formula 


8 = Ive) a4. 


(Hint: Think of the parametrization of the surface.) 


5BOWMAN, F., Introduction to Elliptic Functions, with Applications, New York: Dover, 1961, § II1.7. 
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4.5 Stokes’ Theorem 


So far the only types of line integrals which we have discussed are those along curves in R. 
But the definitions and properties which were covered in Sections 4.1 and 4.2 can easily be 
extended to include functions of three variables, so that we can now discuss line integrals 
along curves in RÎ. 

Definition 4.5. For a real-valued function f(<,y,z) and a curve C in RF, parametrized by 


x = x(t), y = y(t), z = z(t), a < t = b, the line integral of f(x,y,z) along C with respect to 
arc length s is 


b 
|fess.nas = Û FODID,AN) r + y+ 210 dt. (4.34) 


The line integral of f(x, y,z) along C with respect to x is 


b 
[fesa ا‎ f(x(0,y(tD,z(t)x(Ddt . (4.35) 


The line integral of f(x, y,z) along C with respect to y is 


b 
o 1 yy (4.36) 


The line integral of f(x, y,z) along C with respect to z is 
b 
fesa = ا‎ f(x(D,y(t),z(t))z2 (Ddt . (4.37) 
a 


Similar to the two-variable case, if (x,y,z) > 0 then the line integral Ja f(x,y,z2)ds can be 
thought of as the total area of the “picket fence” of height f(x, y,z) at each point along the 
curve C in RÎ. 

Vector fields in RF are defined in a similar fashion to those in R/, which allows us to define 
the line integral of a vector field along a curve in RF. 


Definition 4.6. For a vector field f(x, y,z) = P(x,y,z2)i1+ Q(x,y,z2)] + R(x,y,z)k and a curve C 


in R3 with a smooth parametrization x = x(t), y = y(t), z = z(t), a <= t < b, the line integral 
off along C is 


| sar > | Pes, nax + | 0«.».2dy + | Rens, ae (4.38) 
€ ( (E ( 
b 
= | f(x(D,y(tD,z2(t)-r (dt , (4.39) 


where r(t) = x(t)i+ y(t)]j + z(t)k is the position vector for points on C. 
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Similar to the two-variable case, if f(x, y,z) represents the force applied to an object at a 
point (x, y,z) then the line integral fo f‘ dr represents the work done by that force in moving 
the object along the curve C in RF. 

Some of the most important results we will need for line integrals in R3 are stated below 
without proof (the proofs are similar to their two-variable equivalents). 


Theorem 4.10. For a vector field f(x, y,z) = P(x,y,z2)1+ Q(x,y,z2)] + R(x,y,z)k and a curve 
C with a smooth parametrization x = x(t), y = y(t), z = z(t), a < t <= b and position vector 
r(t) = x(t)i+ y(t)]j + z(t)k, 


| tar = | tTas, (4.40) 
C C 


where T(t) = TFI 2 7 is the unit tangent vector to C at (x(t), y(t),z(t)). 


Theorem 4.11. (Chain Rule) If w = f(x,y,z) is a continuously differentiable function of 
x, y, and z, and x = x(t), y = y(t) and z = z(t) are differentiable functions of t, then w is a 
differentiable function of t, and 


dw _ dwdx dwdy dw dz 


e 4.41 
TSG STG TD IU 0 


Also, if x = x(tı,t»), yY = y(tı,t,) and z = z(t,,t,) are continuously differentiable function of 
(f1,2), then 

E (4.492) 
dt, Ox Ot, dy tı dz Qt, 


and 
dw _ dw Ox dw Oy dw dz 


)4.43( + + ك 
dts dx dtsa dy dts dz Ota‏ 


Theorem 4.12. Let f(x,y,z2) = P(x,y,z2)i1+ Q(x,y,2)] + R(x,y,z)k be a vector field in some 
solid S, with P, Q and R continuously differentiable functions on S. Let C be a smooth 
curve in S parametrized by x = x(t), y = y(t), z = z(t), a < t = b. Suppose that there is a 
real-valued function F(x, y,z) such that VF = fon S. Then 


1 fdr = F(B) - F(A), (4.44) 
C 


where A = (x(a), y(a),z(a)) and B = (x(b), y(b),z(b)) are the endpoints of C. 


6See TAYLOR and MANN, § 6.5 for a proof. 
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Example 4.12. Let f (x,y,z) = z and let C be the curve in R3 parametrized by 
x=tsint, y=tcost, zZ=t, Osts8r. 


Evaluate fo f(x,y,2)ds. (Note: C is called a conical helix. See Figure 4.5.1). 


Solution: Since x(t) = sint + tcost, y(t) = cost —tsint, and z(t) = 1, we have 


x(t) + y1) + z'() = (sin t + 2tsintcost +t cost) + (cost — 2tsintcost +t sin t) + 1 
= (sin t + cost) + sin t + cost +1 


= +2, 


so since f(x(t), y(t),z(t)) = z(t) = t along the curve C, then 


8r 
1 ES FCD, y(D, 2(8) (D2 + y"(E)2 + 2(2 dt 
0 
8r 
= tVt2+2 dt 
0 


8r 1 
3 (64 + 2)** - 22) 1 


12( _ 
(2 + 5( ک 


Figure 4.5.1 Conical helix C 


Example 4.13. Let f(t,y,2) = xi+ yj + 2z k be a vector field in R3. Using the same curve C 
from Example 4.12, evaluate fo f‘dr. 


2 
Solution: It is easy to see that F(x,y,z) = + ج‎ +z is a potential for f(x, y,z) (i.e. VF = £. 
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So by Theorem 4.12 we know that 


[ta = F(B) - F(A) , where A = (x(0), y(0),z(0)) and B = (x(8r), y(87),z2(87)), so 


= F(8rsin8r, 8r cos8r,8r) ¬ F(OsinO0, Ocos0,0) 
F(0,8r,8r) ¬ F(0,0,0) 


8 2 
0+ + e -)0+0+00( = 967? . 


1 


We will now discuss a generalization of Green’s Theorem in R® to orientable surfaces in 
RF, called Stokes’ Theorem. A surface X in RF is orientable if there is a continuous vector 
field N in RF such that N is nonzero and normal to Z (i.e. perpendicular to the tangent plane) 
at each point of Z. We say that such an N is a normal vector field. 


For example, the unit sphere x+y+z% = 1 is orientable, since the 
continuous vector field N(x, y,z2) = xi+ yj +z k is nonzero and normal 
to the sphere at each point. In fact, ~-N(x,y,z) is another normal 
vector field (see Figure 4.5.2). We see in this case that N(x, y,z) is 
what we have called an outward normal vector, and N(x, y,z) is an 
inward normal vector. These “outward” and “inward” normal vec- 
tor fields on the sphere correspond to an “outer” and “inner” side, 
respectively, of the sphere. That is, we say that the sphere is a two- 
sided surface. Roughly, “two-sided” means “orientable”. Other ex- Figure 4.5.2 
amples of two-sided, and hence orientable, surfaces are cylinders, 
paraboloids, ellipsoids, and planes. 

You may be wondering what kind of surface would not have two sides. An example is the 
Möbius strip, which is constructed by taking a thin rectangle and connecting its ends at 
the opposite corners, resulting in a “twisted” strip (see Figure 4.5.3). 


(a) Connect A to A and B to B along the ends (b) Not orientable 
Figure 4.5.3 Möbius strip 
If you imagine walking along a line down the center of the Möbius strip, as in Figure 


4.5.3(b), then you arrive back at the same place from which you started but upside down! 
That is, your orientation changed even though your motion was continuous along that center 
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line. Informally, thinking of your vertical direction as a normal vector field along the strip, 
there is a discontinuity at your starting point (and, in fact, at every point) since your vertical 
direction takes two different values there. The Möbius strip has only one side, and hence is 
nonorientable." 

For an orientable surface Z which has a boundary curve C, pick a unit normal vector n 
such that if you walked along C with your head pointing in the direction of n, then the 
surface would be on your left. We say in this situation that n is a positive unit normal vector 
and that C is traversed r-positively. We can now state Stokes’ Theorem: 


Theorem 4.14. (Stokes’ Theorem) Let X be an orientable surface in R whose boundary 
is a simple closed curve C, and let f(x,y,z) = P(x,y,z)i+ Q(x, y,z)j + R(x, y,z)k be a smooth 
vector field defined on some subset of RF that contains XZ. Then 


fdr = || «eur f)-ndo, (4.45) 
2 
د‎ oR o oP oR 0Q oP 
E 5-Ji + (5-5 + (5-5) (4.46) 
Qy dz 0z Ûx dx ûy 


n is a positive unit normal vector over Z, and C is traversed n-positively. 


Proof: As the general case is beyond the scope of this text, we will prove the theorem only 
for the special case where Z is the graph of z = z(x, y) for some smooth real-valued function 
z(%,y), with (x, y) varying over a region D in RÎ. 


Projecting XZ onto the xy-plane, we see that the closed ٤ 


curve C (the boundary curve of Z) projects onto a closed . 
curve Cp which is the boundary curve of D (see Fig- 
ure 4.5.4). Assuming that C has a smooth parametriza- 
tion, its projection Cp in the xy-plane also has a smooth 


parametrization, say 


Cp:ix=xD0,y=yD,astsb, 
XX 


and so C can be parametrized (in R3) as CD 


Z2:z2=2(%,7) 


Cis x(t) لر‎ = y(t) 2ر‎ = z(x(t), y(t)) , ad“ t<b Figure 4.5.4 


since the curve C is part of the surface z = z(x,y). Now, by the Chain Rule (Theorem 4.4 in 
Section 4.2), for z = z(x(t), y(t)) as a function of t, we know that 


z(t) = e + n 


dx dy 


"For further discussion of orientability, see O’ NEILL, §IV.7. 
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and so 
{tar > | Pes,Das+ Qs, Day +R, 2da 
- (P+ qy + RF [E 1(0 + 0)) 
-[ ((P +R) x'+ (@ +R (0) dt 
a 0x dy 
اك‎ P(x,y)dxs+Q(x,yY)dy , 
where 


(x,y) = P(x,y,z(x,y)) + R(x,y,2(x, کد‎ (, Y) „and 


Q(x,y) = Q(x,y,2(t,7y)) + R(x,Y,2(%, »س‎ 3 7( 


for (x,y) in D. Thus, by Green’s Theorem applied to the region D, we have 


fer = Ie) 4 (4.47) 


so by the Product Rule we get‏ )» 3 س  (@, y,2(x,Y))+ R(x,Y,2(%,‏ را 
yT e Y,2(%, (2 —(x,Y)+ R(x, Y,2(%, a | 0 ( :‏ = 


Now, by formula (4.42) in Theorem 4.11, we have 


0Q dx OQ oy OQ 0z 
0x Ox 0y 0x ` 0z ox 


0 0 OQ û 
= O 2 4 0 


0 
— )Q)<,y,2)%,y((( = 
dx 


dx dy 0z Ox 
_ 4Q 2 OQ dz 
x 0z Ox 
Similarly, 
8 OR OR dz 
—(R ,2(4, = س إ4‎ 
4 e 0x Ûz ûx 
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Thus, 


0 _ 0Q 0Q öz (E,8 0z 

0x êx dz dx dx Oz Ox 
8 0Q ö oR û OR ûz ê 2ق‎ 

= 4Q ك‎ 0Q dz ك‎ OR dz 2 OR ûz dz E 

0x 0z Ox dx Oy dz dx dy Qxdy 


dz 0z 
+ R ,2)%,۷(( 
E xûy 


In a similar fashion, we can calculate 


O _ oP oP 0z OR dz OR ûz ûz 0z 


= ت ا ن ې‎ + : 
dy dy dz dy dy Ox 0z Oy Ox Qyûx 


So subtracting gives 


o oP _ 4 (£ P| 2 8 ٤ 0 
0x, dy (0z Oy] ox 0x dz] dy 0x ûy 1 
since 2 TE 2 3. by the smoothness of z = z(x,y). Hence, by equation (4.47), 
OR OQ) OP OR\0 0 OP 
fea OEE) o 
C dy dx \0z dx] dy 0x ûy 
after factoring out a —1 from the terms in the first two ا‎ in a (4.48). 
Now, recall from Section 2.3 (see p.76) that the vector N = j © 5 3 + k is normal to the 
tangent plane to the surface z = z(x,y) at each point of Z. hie 
N _-fi-gj+k 


INN 
0202 „ )9±( 
(1+ +) 
is in fact a positive unit normal vector to Z (see Figure 4.5.4). Hence, using the e 
tion r(x,y) = xi+ yj + z(x,y)k, for (x,y) in D, of the surface Z, we have gz =1+ ع‎ k and 


k, and so | E x E = > 1 + )( 4 +). So we see that using formula (4.46) for‏ ع =j+‏ ع 


curl f, we have 
|| eno: ‘ndo = (curl f)n jj x — 
dy 
2 (E-3 >) - (+-k) (i زک‎ j+k|d4 
dy dz 0z Ox dx ûy Ox êy 
EEE E-)a 
dy dx (0z dx] Qy dx ûy 


which, upon comparing to equation (4.49), proves the Theorem. QED 


8 
x 4 
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Note: The condition in Stokes’ Theorem that the surface Z have a (continuously vary- 
ing) positive unit normal vector n and a boundary curve C traversed n-positively can be 
expressed more precisely as follows: if r(t) is the position vector for C and T(t) = r'(t)/|lr'(t) || 
is the unit tangent vector to C, then the vectors T, n, T x n form a right-handed system. 

Also, it should be noted that Stokes’ Theorem holds even when the boundary curve C is 
piecewise smooth. 


Example 4.14. Verify Stokes’ Theorem for f(x,y,z) = zi+ xj + yk when Z is the paraboloid 
z = x + y such that z < 1 (see Figure 4.5.5). 


Solution: The positive unit normal vector to the surface 
z= 2),y( = x2 + ر‎ ¡8 


_ 1 اا لر‎ _ -21-2yز+k‎ 
2 2  V/1+4x2 +472 
(1+ +() 
and curl f= (1-0)i+(1—-0)j+(1—-0)k =1i+j + k, so 


(curl fn = (-2x-2y+1)1/ 1+ 4x2 +4 y2 . 


Figure 4.5.5 z= +y 


2 


Since X can be parametrized as r(x,y) = xi + yj + (x + y)k for 
(x,y) in the region D = {(x,y): x2 + y2 < 1}, then 


Or ûr 
ل س‎ 


dA 
dx Oy 


Jj ‘cur 9.nae 2 ا‎ (curl f)n 
3 D 


-2x- 2y +1 
= || ج‎ +2 +4 
“۷1+ 4x2 + ۶ر4‎ 
D 
2 ا‎ / (-2x-2y + 1)dA , so switching to polar coordinates gives 
D 


2r rl 
= (—-2rcosO —- 2r sinO + 1)r dr d0 
0 0 


2r 1م‎ 
= 1 (-2rcos@0 - 2r sin +r)dr d0 
0 0 


r=1 


|4 
r=0 


2 2r3 2r u. r2 
> cosټ‎ sinO + 
3 3 2 


2r 
(=f cos0 - f sin0 + 3) d0 


2 2 107 
= -ğsin0 + §cos0 + 30| = 
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The boundary curve C is the unit circle x% + y% = 1 laying in the plane z = 1 (see Figure 
4.5.5), which can be parametrized as x = cost, y = sint, z = 1 for O0 st < 2r. So 


2r 
{far ((1)(- sint) + (cost)(cost) + (sint)(0))dt 
C 0 


2r 1 2 1+cos2 
1 (- sint + | dt (here we used cost = ا‎ 
0 


2r 


4 tt ۳ sin 2t 
cost + — 
2 4 


0 


So we see that $, f‘ dr = ff (curl f)‘ndo, as predicted by Stokes’ Theorem. 
2 


The line integral in the preceding example was far simpler to calculate than the surface 
integral, but this will not always be the case. 


Example 4.15. Let > be the elliptic paraboloid z = + 2 for z < 1, and let C be its boundary 
curve. Calculate $f‘ dr for f(x, y,z) = (9xz + 2y)i + (2x + y2)j + (-2y + 22)k, where C is 
traversed counterclockwise. 

Solution: The surface is similar to the one in Example 4.14, except now the boundary curve C 
is the ellipse 8 = 1 laying in the plane z = 1. In this case, using Stokes’ Theorem is easier 
than computing the line integral directly. As in Example 4.14, at each point (x, y,z(x, y)) on 


2 
the surface z = z(x,y) = + ج‎ the vector 


_ 02_58 4_275 
ax ^ gyJj +k -gi- gj +k 


1+ +) e , 


is a positive unit normal vector to Z. And calculating the curl of f gives 


curl f = (-4y-0)i + (9x -0)j + (2-2)k = -4yi + 9x] + Ok, 


SO 
_ (-49(-$)+(9(-F)+(0(D  2xy-2xy +0 | 


EE ا‎ 
and so by Stokes’ Theorem 
{far ت‎ Jj ‘uri .nde > || oa. Û, 
C 
92 > 


(curl f)n 
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In physical applications, for a simple closed curve C the line integral $, fdr is often called 
the circulation of f around C. For example, if E represents the electrostatic field due to a 
point charge, then it turns out® that curl E = 0, which means that the circulation $ç E:dr =0 
by Stokes’ Theorem. Vector fields which have zero curl are often called irrotational fields. 

In fact, the term curl was created by the 19 century Scottish physicist James Clerk 
Maxwell in his study of electromagnetism, where it is used extensively. In physics, the 
curl is interpreted as a measure of circulation density. This is best seen by using another 
definition of curl f which is equivalent? to the definition given by formula (4.46). Namely, for 
a point (x,y,z) in RF, 


1 
n-‘(curl f)(<,y,2) = lim z9 fdr, (4.50) 
S¬0 S Jc 


where S is the surface area of a surface Z containing the point (x,y,z) and with a simple 
closed boundary curve C and positive unit normal vector n at (x,y,z). In the limit, think of 
the curve C shrinking to the point (x,y,z), which causes Z, the surface it bounds, to have 
smaller and smaller surface area. That ratio of circulation to surface area in the limit is 
what makes the curl a rough measure of circulation density (i.e. circulation per unit area). 


An idea of how the curl of a vector field is 1y 
related to rotation is shown in Figure 4.5.6. 2 
Suppose we have a vector field f(x, y,z) which 
is always parallel to the x<y-plane at each f 
point (x,y,z) and that the vectors grow larger 
the further the point (x,y,z) is from the y- 
axis. For example, f(r, y,z) = (1+ x2)j. Think 
of the vector field as representing the flow 
of water, and imagine dropping two wheels 
with paddles into that water flow, as in Fig- 
ure 4.5.6. Since the flow is stronger (i.e. the 
magnitude of f is larger) as you move away 0 ٣ 
from the y-axis, then such a wheel would ro- 
tate counterclockwise if it were dropped to Figure 4.5.6 Curl and rotation 
the right of the y-axis, and it would rotate 
clockwise if it were dropped to the left of the y-axis. In both cases the curl would be nonzero 
(curl f(x, y,z) = 2x k in our example) and would obey the right-hand rule, that is, curl f(x, y,z) 
points in the direction of your thumb as you cup your right hand in the direction of the rota- 
tion of the wheel. So the curl points outward (in the positive z-direction) if x > 0 and points 
inward (in the negative z-direction) if x < 0. Notice that if all the vectors had the same di- 
rection and the same magnitude, then the wheels would not rotate and hence there would 
be no curl (which is why such fields are called irrotational, meaning no rotation). 


SSee Ch. 2 in REITZ, MILFORD and CHRISTY. 
9See SCHEY, p. 78-81, for the derivation. 
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Finally, by Stokes’ Theorem, we know that if C is a simple closed curve in some solid 
region S in RF and if f(x, y,z) is a smooth vector field such that curl f = 0 in S, then 


{ far ّ Jj «cur .nao 2 |] o. nao 2 || odo = 0, 
cC > 3 > 


where Z is any orientable surface inside S whose boundary is C (such a surface is some- 
times called a capping surface for C). So similar to the two-variable case, we have a three- 
dimensional version of a result from Section 4.3, for solid regions in RF which are simply 
connected (i.e. regions having no holes): 


The following statements are equivalent for a simply connected solid region S in RF: 


(a) f(x,y,z2) = P(x,y,z2)1+ Q(x,y,z2)] + R(x, y,z2)k has a smooth potential F(x, y,z) in S 


(b) ا‎ fdr is independent of the path for any curve C in S 
0 


(c) fdr = 0 for every simple closed curve C in S 
( 


OR _0Q oP oR Q __oP 


> > : in S (i.e. If=O0Oin S 
ll an ET in S (i.e. cur. in S$) 


(d) 


Part (d) is also a way of saying that the differential form P dx +Q dy +R dz is exact. 


Example 4.16. Determine if the vector field f(x, y,2) = xyzi+xzj+xyk has a potential in RÎ. 


Solution: Since RF is simply connected, we just need to check whether curl f = 0 throughout 


RF, that is, 
OR __öQ OP __öR 4 OQ _oP 


an‏ , —-= س , = س 
dy dz 0z Ox dx Oy‏ 


throughout R3, where P(x,y,2)= xyz, Q(x,y,2) = xz, and R(x, y,z2) = xy. But we see that 


OP OR OP OR 8 
2 —=yلy س کج‎ £ -— for some (x,y,z) in RR. 
dz dx 


Thus, f(x, y,z) does not have a potential in RÎ. 


Exercises 


A 


For Exercises 1-3, calculate fo f(x,y,2)ds for the given function f(x, y,z) and curve C. 


1. f(x,y,2)=z2; C:x=cost, y=sint,2=t, Ost s2r 
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2. f(eyy,2)= + y +292; C:ix=t,y=t,z=1,1stS2 
3 


3. f(,y,2)= 22; C:x=tsint, y =tcost, 2 = 242,32, Osts1 
For Exercises 4-9, calculate J, f‘ dr for the given vector field f(x, y,z) and curve C. 
f(x, y,2)=1-J+k; C:x=3t,y=2t,2=t,Osts1l 


. f(x,y,2)= yi-xj+zak; C:x=cost, y=sint,z=t,Osts 2r 


4. 
5 
6. f(x,y,2)=xi+yj+zk; C:x=cost, yJ =sint,z2=2,O0st s2r 
1. f(x,y,2)= (y-22)i+xyj+ (Axa +y)k; C:x=t,y=2t,2=t-1,0<t<1 

8. f(x,y,2)= yzi+xzj+xyk; C : the polygonal path from (0, 0, 0) to (1,0,0) to (1,2,0) 
9 


. f(x,y,2)= xyi+ (z2 -x)j + 2yzk; C : the polygonal path from (0,0,0) to (1,0,0) to (1,2,0) 
to (1,2,2) 


For Exercises 10-13, state whether or not the vector field f(x, y,z) has a potential in RF (you 
do not need to find the potential itself). 


10. f(x,y,2)= yi—-xj+ zk 11. f(x,y,2)=ai+ bj + ck (a, b, c constant) 


12. f(x,y,2) = (t+ y)i+ xj + 2k 13. f(x,y,2) = xyi— (x - y2 )j + y zk 


For Exercises 14-15, verify Stokes’ Theorem for the given vector field f(x, y,z) and surface Z. 
14. f(x,y,2)= 2yi-xj+zk; XZ: + y2 +2 =1, 2 >0 
15. f(x,y,2)=xyi+xzj+yzk; XI:z=x2+y,2 <1 


16. Construct a Möbius strip from a piece of paper, then draw a line down its center (like 
the dotted line in Figure 4.5.3(b)). Cut the Möbius strip along that center line completely 
around the strip. How many surfaces does this result in? How would you describe them? 
Are they orientable? 


17. Use Gnuplot (see Appendix C) to plot the Möbius strip parametrized as: 


r(u,u) = cosu(l1+ucos $)i+ sinu(1+vcos $)j +usin fk, Osu s2r, -3 >u 


NI 


C 


18. Let Z> be a closed surface and f(x, y,z) a smooth vector field. Show that 
Jf (curl f)-ndo = 0. (Hint: Split XZ in half) 
3 


19. Show that Green’s Theorem is a special case of Stokes’ Theorem. 
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4.6 Gradient, Divergence, Curl and Laplacian 


In this final section we will establish some relationships between the gradient, divergence 
and curl, and we will also introduce a new quantity called the Laplacian. We will then show 
how to write these quantities in cylindrical and spherical coordinates. 

For a real-valued function f(r, y,z) on RF, the gradient Vf(x,y,2) is a vector-valued func- 
tion on RF, that is, its value at a point (x, y,z) is the vector 


Vf(x,y,2) = (5 1) کے‎ 0F; + f + oF 
dx dy dz Ox dy dz 
in RF, where each of the partial derivatives is evaluated at the point (x,y,z). So in this way, 
you can think of the symbol V as being “applied” to a real-valued function f to produce a 
vector Vf. 
It turns out that the divergence and curl can also be expressed in terms of the symbol V. 
This is done by thinking of V as a vector in R3, namely 


Sit jek (4.51) 


Here, the symbols 2, 2 and 2 are to be thought of as “partial derivative operators” that 
will get “applied” to a real-valued function, say f(x,y,z), to produce the partial derivatives 
ّ 3 and 2 For instance, 3, “applied” to f(x, y,z) produces . 

Is V really a vector? Strictly speaking, no, since 2, 2 and 2 are not actual numbers. But 
it helps to think of V as a vector, especially with the divergence and curl, as we will soon see. 
The process of “applying” 2, 2 to a real-valued function f(x, y,z) is normally thought of 
as multiplying the quantities: 


0 0 0 Of 0 Of 
(2)0. (0=. (= 
dx dx dy dy dz dz 

For this reason, V is often referred to as the “del operator”, since it “operates” on functions. 
For example, it is often convenient to write the divergence div f as V°f, since for a vector 


field f(x,y,z2) = fi(x,y,z2)1+ f2(x,y,2)] + f3(x,y,z)k, the dot product of f with V (thought of as a 
vector) makes sense: 


8 0 8 
Vf = | 1 + j + k] «(fk 9,2i + fa(x,y,2)] + fa(x,y,2)K) 
dx dy dz 


8 3 0 
د(2(‎ Ê د(5(‎ 4 2)» 
E 
dx dy dz 
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We can also write curl f in terms of V, namely as V x f, since for a vector field f(x, y,z) = 
P(x,y,z2)i+ Q(x, y,z2)] + R(x, y,z)k, we have: 


1 j k 

i 8 0 
xX > کے کے س‎ 
Ox dy dz 


P(x,y,z2) Q(x,y,z2) R(x,¥,2) 
84; 91; 
۲ 


dy dz dx dz dx ûy 
OR û OP oR 0 oP 

e E  E- 
dy dz 0z Ox dx ûy 

= curl f 


For a real-valued function f(x,y,z), the gradient Vf(x,y,2) = F4 j + Î k is a vector 


dz 
field, so we can take its divergence: 


div Vf = V-Vf 


0 6 0 0 0 
i+ —[ + kK) (GEi + 1; ٣ 
dx dy dz dx dy 
0 (0 
= ةة * (وة) وھ * () چ‎ 
dx \ 0x dy \Oy dz \ dz 
2ق‎ 2 2 
at 
0x2 رق‎ 22 


Note that this is a real-valued function, to which we will give a special name: 


Definition 4.7. For a real-valued function f(x, y,z), the Laplacian of f, denoted by Af, is 
given by 
ûf of of 


ACIZ) = WoW = 0 3 : 4.52 
e 0x2 2ر‎ 22 ( ) 


Often the notation Vf is used for the Laplacian instead of Af, using the convention V% = 
VV. 


Example 4.17. Let r(x, y,2) = xi+yj+z k be the position vector field on R3. Then [lr(x, y,z)|l2 = 
rr = 4 + y2 +z is a real-valued function. Find 


(a) the gradient of ri2 
(b) the divergence of r 
(c) the curl ofr 


(d) the Laplacian of rll 
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Solution: (a) V|lr|l = 2xi+ 2yj + 22k = 2r 
b) V-r= 2()+ 2 (y)+ (2= 1+1+1=8 
(c) 


FF E 
Fes A 2 Foi OD Qk = Û 
dx Ody dz 
Xx Yy z# 


(® All rll = 2 2 + رو + 2ے +( + ر + 2 +( + رو‎ + (= 2+2+2=6 
Note that we could have calculated A] r || another way, using the V notation along with parts 
(a) and (b): 
Alri = V-Vjlrll® = V-2r = 2V-F = 2(8) = 6 
Notice that in Example 4.17 if we take the curl of the gradient of || r|l we get 
Vx(Vjlrjl%) = Vx2r = 2Vxr = 20 =0. 
The following theorem shows that this will be the case in general: 


Theorem 4.15. For any smooth real-valued function f(x, y,z), V x (Vf) =0. 


Proof: We see by the smoothness of f that 


i j k 
0 0 0 
Vx(Vf) = |x dy Oz 
Of Of ûf 
0x Oy ûz 


3 e 5 1j 1| 
Qyd0z dzûy Qxd0z O0zûx Qxûy dyûx 2 


since the mixed partial derivatives in each component are equal. QED 


Another way of stating Theorem 4.15 is that gradients are irrotational. Also, notice that 
in Example 4.17 if we take the divergence of the curl of r we trivially get 


V-(Vxr) = V0 =0. 
The following theorem shows that this will be the case in general: 


Theorem 4.17. For any smooth vector field f(x, y,z), V*(V xf =0. 


The proof is straightforward and left as an exercise for the reader. 
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Proof: Let Z be a closed surface which bounds a solid S. The flux of V x f through Z is 


jie xf)da = jjj. xf) dV (by the Divergence Theorem) 
2 S 


O0OdV (by Theorem 4.17)‏ ا 
S‏ 


0. QED 


There is another method for proving Theorem 4.15 which can be useful, and is often used 
in physics. Namely, if the surface integral Jf f(x<,y,2)do = 0 for all surfaces X in some solid 
> 


region (usually all of RF), then we must have f(x, y,z) = 0 throughout that region. The proof 
is not trivial, and physicists do not usually bother to prove it. But the result is true, and can 
also be applied to double and triple integrals. 

For instance, to prove Theorem 4.15, assume that f(x, y,z) is a smooth real-valued func- 
tion on R3. Let C be a simple closed curve in RF and let XZ be any capping surface for C (i.e. 
Z> is orientable and its boundary is C). Since Vf is a vector field, then 


|j «0x6 -nde > Vf ‘dr by Stokes’ Theorem, so 
2 C 
= 0 by Corollary 4.13. 


Since the choice of Z was arbitrary, then we must have (V x (VF))-n = 0 throughout RF, where 
n is any unit vector. Using i, j and k in place of n, we see that we must have V x (Vf) = 0 in 
RF, which completes the proof. 


Example 4.18. A system of electric charges has a charge density p(x, y,z) and produces an 
electrostatic field E(x, y,z) at points (x,y,z) in space. Gauss’ Law states that 


|| Bae 4r || p dV 
P3 S 


for any closed surface Z which encloses the charges, with S being the solid region enclosed 
by XZ. Show that VE = 4rp. This is one of Maxwell’s Equations." 


10In Gaussian (or CGS) units. 
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Solution: By the Divergence Theorem, we have 


E‘doe‏ ا = dV‏ ل 


2 


Ar 1 / 0p dV by Gauss’ Law, so combining the integrals gives 
S 


0 , so 


i (V-E-4rp) dV 
S 


V-E-4rp = 0 since Z and hence S was arbitrary, so 
VE = 4rp. 


Often (especially in physics) it is convenient to use other coordinate systems when dealing 
with quantities such as the gradient, divergence, curl and Laplacian. We will present the 
formulas for these in cylindrical and spherical coordinates. 

Recall from Section 1.7 that a point (x,y,z) can be represented in cylindrical coordinates 
(r,0,z), where x = rcosO0, y= rsinO, z =z. At each point (r,0,z), let er, eg, ez be unit vectors 
in the direction of increasing r, 0, z, respectively (see Figure 4.6.1). Then er, eg, ez form an 
orthonormal set of vectors. Note, by the right-hand rule, that ez x er = eg. 


م€ 
eg‏ 2 

(%7,2) 
e4 

0 
9 0 
و 

x x Z0 


2 y (e, y,0( 


2 


Figure 4.6.1 Figure 4.6.2 
Orthonormal vectors er, eg, ez Orthonormal vectors ep, €g, ep 
in cylindrical coordinates in spherical coordinates 


Similarly, a point (x, y,z) can be represented in spherical coordinates (0,0,p), where x = 
psinpcos0, y = psinpsin0, z = pcosp. At each point (0,0,4), let ep, eg, ep be unit vectors 
in the direction of increasing 0, 0, p, respectively (see Figure 4.6.2). Then the vectors ep, eg, 
e4 are orthonormal. By the right-hand rule, we see that eg x e, = eq. 

We can now summarize the expressions for the gradient, divergence, curl and Laplacian 
in Cartesian, cylindrical and spherical coordinates in the following tables: 
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Cartesian (x,y,z): Scalar function F'; Vector field f= fi+ fj+ fk 
gradient: VF = — 
divergence: Vf = — 


curl: V xf = 


eS 


0F 
Laplacian: AF = + 


, 1۴ 
r 00 
10 
r 30 dz 
10 0 0 0 
curl: V x f (E e, + (Ee eg + 


gradient: VF = eg + — 


divergence: Vf = 
r 


r 30 dz dz Or 


Laplacian: AF 


êF oF‏ 1 ا 


+ + 
r Or r Or r2 802 0z 


Spherical (0,0,4): Scalar function F'; Vector field f= fo, eo + fg eg + fp ep 


mz > 2 + e 
psinp 00 p0 dp 
1 fo 0 
00 psinp Op 
1 


e 
1 0 
psinp 00 م‎ dp 
1 û ( „OF 1 0F 1 
2 0 4# 5 2 E 2: SIn 
p2 dp dp p2sin“ p 00 p2sin Op 


0 
gradient: VF = — 


divergence: V-f = 2 fo) + E 


7 00 
curl: V xf = 


Laplacian: AF = 


The derivation of the above formulas for cylindrical and spherical coordinates is straight- 
forward but extremely tedious. The basic idea is to take the Cartesian equivalent of the 
quantity in question and to substitute into that formula using the appropriate coordinate 
transformation. As an example, we will derive the formula for the gradient in spherical 
coordinates. 
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Goal: Show that the gradient of a real-valued function F(0,0,p) in spherical coordinates is: 


vF OF 1 OF 1 OF 
= ۲e e e 
00 °  psinp 00 ° ° pop * 
Idea: In the Cartesian gradient formula VF(x, y,z) = i48 oy + 5 £ k, put the Cartesian ba- 
sis vectors i, j, k in terms of the spherical coordinate E vectors €), eg, ep and functions of 
0, 0 and $. Then put the partial derivatives E 9 5 1 9E in terms of 9E 0 9F 1 5 and functions 


of p0, 0 and Q. 


Step 1: Get formulas for e, eg, ep in terms of i, j, k. 

We can see from Figure 4.6.2 that the unit vector e, in the p direction at a general point 
(0,0,@) is ep = TT where r = xi+ yj + zk is the position vector of the point in Cartesian 
coordinates. Thus, 


e س‎ = 
0 
rll V2 + y2 +22 


so using x = psinpcosO0, y = psinpsinO, z = pcosp, and p = Vx + y2 +z, we get: 


r xi+yj+zk 


ep = sinp cos0i + sinp sinO0j + cospk 


Now, since the angle 0 is measured in the x<y-plane, then the unit vector eg in the 0 
direction must be parallel to the xy-plane. That is, eg is of the form ai + bj + 0k. To figure 
out what a and b are, note that since eg L ep, then in particular eg L e, when e, is in the 
xy-plane. That occurs when the angle Q is xr/2. Putting Q = r/2 into the formula for ep gives 
e, = cos0i+sin0j +O0k, and we see that a vector perpendicular to that is ~ sin0i+cos0j + 0k. 
Since this vector is also a unit vector and points in the (positive) 0 direction, it must be eg: 


eg = -sinOi + cos0j + Ok 


Lastly, since ep = eg x ep, we get: 


ep = cospcos0i + cosp sinO0j ¬ sinpk 


Step 2: Use the three formulas from Step 1 to solve for i, j, k in terms of ep, eg, eq. 

This comes down to solving a system of three equations in three unknowns. There are 
many ways of doing this, but we will do it by combining the formulas for e, and ey to 
eliminate k, which will give us an equation involving just i and j. This, with the formula for 
eg, will then leave us with a system of two equations in two unknowns (i and j), which we 
will use to solve first for j then for i. Lastly, we will solve for k. 

First, note that 

sinpe, + cospep = cos0i + sin0j 
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so that 

sin0 (sinpep, + cospey) + cos0eg = (sin 0 + cos0)j = j, 
and so: 
j = sinp sinOep + cos0eg + cosQ sin ep 


Likewise, we see that 
cos0(sinpe, + cospep) — sindeg = (cos 0 + sin 0)i 2 


and so: 
i = sinp cose, — sindeg + cosp cos0 ep 


Lastly, we see that: 
k = cospep ¬ sinpep 


OF OF oF; OF OF oF 
Step 3: Get formulas for 5 0 00° o iP terms of 5r, 07° 02° 


By the Chain Rule, we have 
OF OE OE OF OF E 
00 dx 0p ûy ûp dz do 
OF _ OF x , OF Oy , OF e 
00 0x 00 ûy 00 dûz 0 
OF EO OR OY OE 
0p Ox Op dy Op Oz dp 


which yields: 
EEL + sin i + i 
dp dx dy 0 
OF : . 0F : OF 
a —psinp I + psinQp e 
OF OF . „OF . OF 
3 = PSOE + ENS = pa 
Step 4: Use the three formulas from Step 3 to solve for 9E, و‎ 9E in terms of 9F, 


Again, this involves solving a system of three equations in three unknowns. Using a 
similar process of elimination as in Step 2, we get: 


oF 1 8 OF . OF OF 
س‎ = p sin“ p cos — -—- sinO0 — + sin cos cosO0 — 
dx psinQp dp 30 Op 
OF 1 . 9, . 0F OF : . „OF 
ا‎ : p sin” p sinO0 — + cosO0 — + sinQ cosp sing — 
dy psinp dp 00 dp 
OF 2 0 . 
— = = | pcosp— - sinp — 
ET 0% 
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Step 5: Substitute the formulas for i, j, k from Step 2 the کک‎ for 9E, 5 E from 


Step 4 into the Cartesian gradient formula VF (x,y,z) = 9E 1+ j j + 5 Fk. 


Doing this last step is perhaps the most tedious, since it involves simplifying 3 x 3+ 3 x 3+ 
2 x 2 = 22 terms! Namely, 


VF = : (o sin ¢ cos = sin0 O + sing cos c00 5; | (sin cos0 e, = sin0 e; 
psinp dp 00 Op 
+ cos cos0 ep) 
+ م‎ sin p sinê 2 + cos + sin Q cos ¢ sind 7) (sin Q sinO e, + cos0 eg 
psinp dp 00 l0) 


+ cosQ sinO ep) 
1 OF OF 1 

+ N ET 34 (cospe, - sinpeg) , 
0 


which we see has 8 terms involving e, 6 terms involving eg, and 8 terms involving ey. But 
the algebra is straightforward and yields the desired result: 
OF 1 OF 1 OF 


VF = + + 
00 °° ` psinp 00 °° ° pop ® 


Example 4.19. In Example 4.17 we showed that Vjlr|l2 = 2r and All r || 2 = 6, where r(x,y,2) = 
xi+ yj + zk in Cartesian coordinates. Verify that we get the same answers if we switch to 
spherical coordinates. 

Solution: Since ||lr||2 = x + م = 2ے + ر‎ in spherical coordinates, let F(0,0,p) = م‎ (so that 
F(p0,0,p) = Ir|l2). The gradient of F in spherical coordinates is 


OF 1 OF 1 OF 


VF = — — س‎ 
GG #F DEIRD D0 ~00 
1 1 
=2 0 - (0 
pep + eT Jeg + 0 ep 
= 2peم‎ = م2‎ , as we showed earlier, so 
ج‎ 2p = 2r, as expected. And the Laplacian is 
1 O ( „OF 1 0F 1 d0(. OF 
AF 0 + 5 RE O sin p — 
0p 0p dp p2sin“ p 00 p*sinp Op 0p 
1 © © 1 1 8 
= س ج‎ )p 2p( + 0) + 1 0 
2م‎ 7 0 TT p2sinp OQ (sin (0)) 
1 0 
= > 2p) + 0 + 0 
pٌ* dp 


1 
= > (م6)‎ = 6, as expected. 
0 
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Exercises 
A 
For Exercises 1-6, find the Laplacian of the function f(x, y,z) in Cartesian coordinates. 
1. f(x,y,2)=x+y+z 2. f),y,2(= ° 3. ۶), 2ے + 2ر + 2) = (ع ,ر‎ 
4. f(x,y,2)= e 5. f),ر,2(‎ = ×3 + 3ے + ر‎ 6. f(uyy,2)= e 9# 


1. Find the Laplacian of the function in Exercise 3 in spherical coordinates. 
8. Find the Laplacian of the function in Exercise 6 in spherical coordinates. 


z2 
9. Let f(x,y,2)= e in Cartesian coordinates. Find Vf in cylindrical coordinates. 
TY 


10. For f(r,0,z2) = rer, +z sinOQ eg + rz e, in cylindrical coordinates, find div f and curl f. 


11. For f(0,0,p) = ep + p cos0 eg + peq in spherical coordinates, find div f and curl f. 
B 


For Exercises 12-23, prove the given formula (r = |lr || is the length of the position vector field 


r(x,y,2)= xi+ yj+zk). 


12. V(1/r)= -r/r3 18. A(1/r)=0 14. V-(r/r3)=0 15. V(lnr) = r/r* 
16. div(F+G) = div F + div G 17. curl(F +G) = curl F + curl G 

18. div(fF) = fdivF + F‘Vf 19. div(F x G) = G‘curl F - F‘curl G 
20. div(Vf x Vg) = 0 21. curl(fF) = fcurl F + (Vf)xF 
22. curl(curl F) = V(div F) - AF 23. A(fg) = fAg + gAf + 2(Vf°Vg) 
C 


24. Prove Theorem 4.17. 


25. Derive the gradient formula in cylindrical coordinates: VF = 9E er + 1 9F eg + 9E e, 


26. Use f= uVu in the Divergence Theorem to prove: 
(a) Green's first identity: Jf (u Av + (Vu)-(Vu)dV = ff (u Vu)*da 
S 2 


(b) Green's second identity: JJJ (u Au - vuAu)dV = ff(uVu — uVu)‘da 
S 2 


27. Suppose that Au = 0 (i.e. u is harmonic) over R3. Define the normal derivative ع‎ of u 


over a closed surface Z with outward unit normal vector n by 9 = Du =n ‘Vu. Show 


that Jf o do =0. (Hint: Use Green’s second identity.) 
2 
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Appendix A 


Answers and Hints to Selected Exercises 


yJy=1,2=3+t (c©0(x-2=2-8,y=1 
5.x=1+2t, y= -2 +t, 2 = -3 +81 
1.1.65 9. )1,2,3( 

11. 4x-4y+3z2-10=0 
13.x<-2y-z2+2=0 

15. 11x -24y + 212-26 =0 
19.x<=5t, y=2+3t, 2= —t 


17. 9/۷/35 
21. )10,-2,1( 


Section 1.6 (p. 46) 


1. radius: 1, center: (2,3,5) 3. radius: 5, 
center: (-1,—-1,—-1) 5. No intersection. 

1. circle x% + y% = 4 in the planes z = +7۷5 

9. lines * = +, 2 =0 and &=-*,z=0 


13. (E و‎ .,0( 


Section 1.7 (p. 50) 


1. (a) (4,‡,-1) (b) (V17,‡,1.816( 

3. (a) (2V7, 3Z,0) (b) (2/7, 3, 3) 

5. (ar? +2 =25 (b)p=5 

71. (a) r +92 = 36 (b) 0(1 + 8cos p) = 36 
10. (a,0,acotp) 12. Hint: Use the distance 
formula for Cartesian coordinates. 


Section 1.8 (Pp. 57) 


1. f(0) = (1,2,32); x =1+t,y=2 =1 

3. f(t) = (-2sin2t,2cos2t,1);j x =1, 
yJy=2t,z=t 5. v(t) =(1,1—cost,sint), 

a(t) = (0,sint, cost) 

9. (a) Line parallel to e (b) Half-line paral- 
lel to e (c) Hint: Think of the 
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Chapter 1 

Section 1.1 (p. 8) 

1. (a) V5 (b) V5 (© vV17 (@1 
(e) 2۷17 2. Yes 3. No 


Section 1.2 (p. 14) 


1. (a) (- 2 -3( )b( )2,6,-1( 

)( رم کا ,ت ,ی])‎ 
(D (14,-6,8) (g) (-7,3,-4) 
(h) (-1,-6,1) (DD (-2,—-4,2) 
3. No. llvll + llwll| is larger. 


() No. 


Section 1.3 (p. 18) 


1.10 3.783.4 5.90° 7.0 

9. Yes, since vw =0. 

11. |v-w| =0 < 2175 = lvl lw 

13. lv + wll = V26 < 21+ ۷5 = |v || + lw 
15. Hint: use Definition 1.6. 

24. Hint: See Theorem 1.10(c). 


Section 1.4 (p. 29) 


1.(-5,-23,-24) 3. )8,4,-5() 5.0 
7.16.72 9.4۷5 11.9 13.0 
and (8,-10,2) 15. 14 


Section 1.5 (p. 39) 


1. (a) (2,3,-2)+1(5,4,-3) (hb) x= 2+5t, 
y=3+4t,2 = -2- 8 TT 
3. (a) (2,1,3)+(1,0,) (bx = 2+t 
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functions as position vectors. E - 3) + y+ 42 
. . 2 
15. Hint: Theorem 1.16 21. £ y2, 2 0 2 - ,ام2‎ 
2ق‎ 


ty =1 ا‎ +1 23. E = 12, 
2 
£ =0, =0 25. =2 


Section 1.9 (p. 63) 


ax2 9 


° xû 
1. Z5 3. 2(5%*2-8) 5. Replace و م‎ 
2 2 =0 
2/3 gy YJ ° xûy 
iby ((;18) -4)/9 6. Hint: Use 
Theorem 1.20(e), Example 1.37, and Section 2.3 (Pp. 77) 


Theorem 1.16 7. Hint: Use Exercise 6. 

9. Hint: Use f() = [f(OIT, differentiate I 2+ 3Y 77-850 3. 2r+ty-2-2=0 
that to get f”(t), put those expressions into 5. x+2y=z 1. 3(-1)+ g(y > 2) + > z 
f(t) x f(t), then write T'(t) in terms of 211) = 0 9. 3x+4y- 52 =0 

N. 11. T(E) = Z(-sint,cost, 1), N() = 

(-cost,~sint,0), B(t) = 3(sint, = cost, 1), Section 2.4 (p. 82) 


)سلج ,سج ) .3 )24,2y(‏ .1 1/2 = )(« 


5. (1/x,1/y) 7. (YZ COS(XYZ),X2 COS(XYZ),XY COS(XYZ)) 


Chapter 2 9. (2,2,22) 11. 2⁄2 18. 
15. V3 cos(1) 17. increase: (45,20), 
Section 2.1 (p. 70) decrease: (-45,-20) 


1. domain: R, range: [-1,00) 3. domain: 


ES Ey A TDETOSS Section 2.5 (p. 88) 


5. domain: RP, range: [-1,1] 7.1 1. local min. (1,0); saddle pt. (1,0) 

9. does not exist 11.2 13.2 15.0 3. local min. (1,1); local max. (-1,-1); sad- 

17. does not exist dle pts. (1,-1),(-1,1) 5. local min. (1,1); 
saddle pt. (0,0) 17. local min. (0,0) 

Section 2.2 (p. 74) 9. local min. (-1,1/2) 11. width = height = 


depth=10 13.x=y=4,z2=2‏ د د 
2x, = =2y 3. 9F = «(2 + y+ 4)12,‏ = 2 .1 


f > = 0 E 12 ,ر + 7ر = أ ,چ‎ £ - Section 2.6 (p. 95) 


;(0.3998-,0.2858( = : )0,0( = )%0,0( .2 0= £ ,4 = £ .7 کک 
ER (xo, yo) = (1,1): = (1.03256, -1.94037(‏ 8 + 2( = $ .9 
y+ 4) 2,‏ + 3(2 = 91 11 
y +423 18. £ = 2e, Section 2.7 (p. 100)‏ + د 
e 15. 9f = ycos(%y), 1. min. (34, 2); max ((‏ 
( د( xcos(xy) 117. 3£ =9, f =2, 3. min (E, A); max‏ 1 
4min. (2,0, 2); max. (2,8, 7)‏ ,4+7+4 += .19 0= 2 
a +y +4) 3, 5‏ 
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Section 4.1 (p. 142) 


1.1/2 83.23 5. 24r 71. -2r 9. 2r 
11. 0 


Section 4.2 (p. 149) 


1.0 3.No 4. Yes. F(x,y)= . 
5. No 9. (b) No. Hint: Think of how F is 
defined. 10. Yes. F(x,y)=axy+bx+cy+d 


Section 4.3 (p. 155) 


1. 16/15 3. -5r 5. Yes. F(x,y) = xy? + x3 
1. Yes. F(x, y) = 4x y + 2y + 3% 


Section 4.4 (p. 163) 
1. 216r 2.3 3.12/5 7. 15/4 


Section 4.5 (p. 175) 


1. 221 2. (17۷/17-57۷5)/3 3. 25 

4.2 5.2r(r-1) "71.67/15 9.6 

11. Yes 13. No 19. Hint: Think of how 
a vector field f(x,y) = P(x, y)i+ Q(x, y)j in R 
can be extended in a natural way to be a vec- 
tor field in RÎ. 


Section 4.6 (p. 186) 
1.0 3.12/2 + 2ر‎ +22 5. 6+ y+ 2( 


7.120 8. (4p? -6)e* 9.-#e,+ ke, 
11. div f= ج‎ - . + cot; 

curl f= cot cos0 ep + 2eg —- 2cos0 ep 

25. Hint: Start by showing that er = cos0i+ 


sin0], eg = -sin0i+cos0j, ez =k. 


Chapter 3 


Section 3.1 (p. 104) 
1.1 3.34 5.5 7.5 9.31 11.5 


Section 3.2 (p. 109) 


1.1 3. 8ln2-3 5. 
10. $ 


سإ* 
a‏ 
N‏ 
CC‏ 

سا 


1 6. 


Section 3.3 (p. 112) 


1. $ 38. (2cos(7)+ r — 2)/4 
10. 


1.6 


9 
o 


Section 3.4 (p. 116) 


1. The values should converge to x 1.318. 
(Hint: In Java the exponential function e* 
can be obtained with Math.exp(x). Other 
languages have similar functions, otherwise 
use e = 2.7182818284590455 in your pro- 
gram.) 

2. x×1.146 3. = 0.705 4. × 0.168 


Section 3.5 (p. 123) 
1.87 3. (8-33) 7.1-2 9. ab 


Section 3.6 (p. 127) 
1. (1,8/3) 3. (0,4) 5. (0,37/16) 
7. (0,0,50/12) 9. (7/12,7/12,7/12( 
Section 3.7 (p. 134) 


1. yr 2.1 6. Both are 1 


n 
(n+1)2(n+2) ‘n 


Appendix B 


We will prove the right-hand rule for the cross product of two vectors in RF. 


For any vectors v and w in RF, define a new vector, n(v, w), as follows: 
9 3 


1. Ifv and w are nonzero and not parallel, and 0 is the angle between them, then n(v, w) is 
the vector in RÛ such that: 


(a) the magnitude of n(v, w) is ll vill ll wil sinO, 
(b) n(v,w) is perpendicular to the plane containing v and w, and 


(c) v, w, n(v,w) form a right-handed system. 
2. Ifv and w are nonzero and parallel, then n(v, w) = 0. 
3. If either v or w is 0, then n(v,w) =0. 


The goal is to show that n(v, w) = v x w for all v, w in R3, which would prove the right-hand 
rule for the cross product (by part 1(c) of our definition). To do this, we will perform the 
following steps: 


Step 1: Show that n(v,w) = v xw if v and w are any two of the basis vectors i, j, k. 


This was already shown in Example 1.11 in Section 1.4. 


Step 2: Show that n(av, bw) = ab(v x w) for any scalars a, b if v and w are any two of the 
basis vectors i, j, k. 
If either a = 0 or b = 0 then n(av, bw) = 0 = ab(v x w), so the result holds. So assume that 
a#O0and b #0. Let v and w be any two of the basis vectors i, j, k. For example, we will show 
that the result holds for v =i and w = k (the other possibilities follow in a similar fashion). 
For av = ai and bw = bk, the angle 0 between av and bw is 90°. Hence the magnitude 
of n(av,bw), by definition, is |laill || bkl| sin 90° = |ab|. Also, by definition, n(av, bw) is per- 
pendicular to the plane containing ai and bk, namely, the xz-plane. Thus, n(av, bw) must 
be a scalar multiple of j. Since its magnitude is |ab|, then n(av, bw) must be either |ab |j or 
-|lablj. 
There are four possibilities for the combinations of signs for a and b. We will consider the 
case when a > 0 and b > 0 (the other three possibilities are handled similarly). 
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In this case, n(av, bw) must be either abj or -abj. Now, since i, j, k form a right-handed 
system, then i, k, j form a left-handed system, and so i, k, —j form a right-handed system. 
Thus, ai, bk, -abj form a right-handed system (since a > 0, b > 0, and ab > 0). So since, by 
definition, ai, bk, n(ai, bk) form a right-handed system, and since n(ai, bk) has to be either 
abj or -abj, this means that we must have n(ai, bk) = —-abj. 

But we know that ai x bk = ab(i x k) = ab(-j) = -abj. Therefore, n(ai, bk) = ab(i x K), 
which is what we needed to show. 

'. n(av,bw)=ab(v xw) Vv 


Step 3: Show that n(u,v + w) = n(u, v) + n(u, w) for any vectors u, V, W. 


If u = 0 then the result holds trivially since n(u, v + w), n(u,v) and n(u,w) are all the zero 
vector. If v = 0, then the result follows easily since n(u,v + w) = n(u,0 + w) = n(u,w) = 
0+ n(u,w) = n(u,0) = n(u,w) = n(u,v)+ n(u,w). A similar argument shows that the result 
holds if w =0. 

So now assume that u, v and w are all nonzero vectors. We will describe a geometric 
construction of n(u, v), which is shown in the figure below. Let P be a plane perpendicular 
to u. Multiply the vector v by the positive scalar |l ull, then project the vector |l ull v straight 
down onto the plane P. You can think of this projection vector (denoted by projpllull v) as 
the shadow of the vector |lull v on the plane P, with the light source directly overhead the 
terminal point of [lull v. If 0 is the angle between u and v, then we see that projp ll ull v has 
magnitude [lull ll v Il sin@, which is the magnitude of n(u, v). So rotating projp ll ull v by 90° 
in a counter-clockwise direction in the plane P gives a vector whose magnitude is the same 
as that of n(u, v) and which is perpendicular to projp ll ull v (and hence perpendicular to v). 
Since this vector is in P then it is also perpendicular to u. And we can see that u, v and 
this vector form a right-handed system. Hence this vector must be n(u,v). Note that this 
holds even if u || v, since in that case 0 = 0° and so sinO = 0 which means that n(u, v) has 
magnitude 0, which is what we would expect. 


0 


projpllullv ` 


Nn(u, V) 


Now apply this same geometric construction to get n(u, w) and n(u, v + w). Since || ull (v + 
w) is the sum of the vectors |lull v and |l ull w, then the projection vector projp |l ull (v + w) is 
the sum of the projection vectors projp llull v and projp llull w (to see this, using the shadow 
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analogy again and the parallelogram rule for vector addition, think of how projecting a 
parallelogram onto a plane gives you a parallelogram in that plane). So then rotating all 
three projection vectors by 90° in a counter-clockwise direction in the plane P preserves that 
sum (see the figure below), which means that n(u, v + w) = n(u, v) + n(u,w). V 


lull v 


projpl\ ull (v + w) a> EFF 2 ¥ 


projp 1 ull w 


n(u, v) 


N(u, V+ Ww) 


Step 4: Show that n(w,v) = —n(v,w) for any vectors V, W. 


Ifv and w are nonzero and parallel, or if either is 0, then n(w, v) = 0 = —n(v,w), so the result 
holds. So assume that v and w are nonzero and not parallel. Then n(w,v) has magnitude 
lIwll llvll sin@, which is the same as the magnitude of n(v, w), and hence is the same as the 
magnitude of -n(v,w). By definition, n(v, w) is perpendicular to the plane containing w and 
v, and hence so is -n(v,w). Also, v, w, n(v,w) form a right-handed system, and so w, v, 
n(v,w) form a left-handed system, and hence w, v, -n(v,w) form a right-handed system. 
Thus, we have shown that —-n(v,w) is a vector with the same magnitude as n(w,v) and is 
perpendicular to the plane containing w and v, and that w, v, ~n(v, w) form a right-handed 
system. So by definition this means that —~n(v, w) must be n(w,v). 


Step 5: Show that n(v,w) = v x w for all vectors v, w. 


Write v =u,i+u,] +uşk and w = w,i+ wa] + wk. Then by Steps 3 and 4, we have 
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n(v,w) = n(vuبi+u,]j‎ +ugşk,wıi+ wa] + wsk) 
= n(vıi+u,]j+uşk,wıÙD) + n(vuı1+ u,] +usk, wa] + ws k) 
= n(vıi+u,j+usşk,wıÙD) + n(vıi+u,]j +usşk,wa]) + n(vuı1+u,] + us k,w; Kk) 


= -n(wرlvui+u,j+usşk)‎ + —-n(w»]j,uıi+ uj +usk) + —-n(wsk,u,i+ u] + us K). 


We can use Steps 1 and 2 to evaluate the three terms on the right side of the last equation 
above: 


-n(wرi,vıi+u,j+usşk)‎ = —-n(w,i,u,) + —-n(w,i,v,]) + —~n(w,i,ugk) 
= -uıwıN(i,i) + usw n(i,]) + ~vugw,رn(i,‎ k) 
= -uıwر(1‎ XÎ) + -vugwر(1‎ xX j) + —~vugwر(1‎ xX kK) 
= -uwس0‎ + -uوwس چ‎ + -uںوwس)-[(‎ 


-n(wıl,uıi+u,j+ ugk) = usw k + uswı] 
Similarly, we can calculate 
-n(wsj,Uıi+u,]j +uşk) = uwk — ugwai 
and 
-n(wsj,Uıi+u,]j +usşk) = uwa] + Uws1 . 
Thus, putting it all together, we have 


n(v,w) = -ugwık + usw] + uıwk — uşwai — Uıwaj + Uwai 


= (UwWs ~UgWa)i + (Uw ¬ U1 Wa)j + (Uw ~ uw )k 


vxw by definition of the cross product. 


`. N(v,w) =v xw for all vectors v, w. V7 


So since v, w, n(v,w) form a right-handed system, then v, w, v x w form a right-handed 
system, which completes the proof. 


Appendix C 


3D Graphing with Gnuplot 


Gnuplot is a free, open-source software package for producing a variety of graphs. Versions 


are available for many operating systems. Below is a very brief tutorial on how to use 
Gnuplot to graph functions of several variables. 


INSTALLATION 


Go to http://www.gnuplot.info/download. html and follow the links to download the lat- 
est version for your operating system. For Windows, you should get the Zip file with a 
name such as gp420win32. zip, which is version 4.2.0. All the examples we will discuss 
require at least version 4.2.0. 


. Install the downloaded file. For example, in Windows you would unzip the Zip file you 


downloaded in Step 1 into some folder (use the “Use folder names” option if extracting 
with WinZip). 


1 


RUNNING GNUPLOT 


In Windows, run wgnuplot.exe from the folder (or bin folder) where you installed Gnu- 
plot. In Linux, just type gnuplot in a terminal window. 


. You should now get a Gnuplot terminal with a gnuplot> command prompt. In Windows 


this will appear in a new window, while in Linux it will appear in the terminal window 
where the gnuplot command was run. For Windows, if the font is unreadable you can 
change it by right-clicking on the text part of the Gnuplot window and selecting the 
“Choose Font...” option. For example, the font “Courier”, style “Regular”, size “12” is 
usually a good choice (that choice can be saved for future sessions by right-clicking in the 
Gnuplot window again and selecting the option to update wgnuplot.ini). 


. At the gnuplot> command prompt you can now run graphing commands, which we will 


now describe. 


1. 


GRAPHING FUNCTIONS 


The usual way to create 3D graphs in Gnuplot is with the splot command: 


splot <range> <comma-separated list of functions> 
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For a function z = f(x,y), <range> is the range of x and y values (and optionally the range 
of z values) over which to plot. To specify an x range and a y range, use an expression of the 
form [a : bl[c :d|, for some numbers a < b and c < d. This will cause the graph to be plotted 
fora sxsbandcsysd. 


Function definitions use the x and y variables in combination with mathematical operators, 


listed below: 
Symbol Operation ) Example | Result 
+ Addition 2+3 5 
ت‎ Subtraction 3-2 1 
Multiplication 2*3 6 
/ Division 4/2 2 
2 Power 2**3 23 =8 
exp(x) e* exp(2) e 
log(x) Inx 1og(2) In2 
sin(x) Sinx sin(pi/2) 1 
cos(x) COSX cos(pi) -1 
tan(x) tanx tan(pi/4) 1 
Example C.1. To graph the function z = 2x% + y from x = -1 to x = 1 and from y = -2 to 


y =2, type this at the gnuplot> prompt: 
splot [-1:1][-2:2] 2xx**%2 + J**2 


The result is shown below: 


2X H2 Yy# #2 


© ۸ ج تن‎ O O 
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Note that we had to type 2*x**2 to multiply 2 times x. For clarity, parentheses can be used 
to make sure the operations are being performed in the correct order: 


splot [-1:1][-2:2] 2*(x**%2) + J**2 


In the above example, to also plot the function z = e** on the same graph, put a comma 
after the first function then append the new function: 


splot [-1:1][-2:2] 2*(x**%2) + y**%2, exp(x+y) 


By default, the x-axis and y-axis are not shown in the graph. To display the axes, use this 
command before the splot command: 


set zeroaxis 


Also, by default the x- and y-axes are switched from their usual position. To show the axes 
with the orientation which we have used throughout the text, use this command: 


set view 60,120,1,1 


Also, to label the axes, use these commands: 


set xlabel "x 


set ylabel "y 


set zlabel "z" 


To show the level curves of the surface z = f(x,y) on both the surface and projected onto the 
xy-plane, use this command: 
set contour both 


The default mesh size for the grid on the surface is 10 units. To get more of a colored/shaded 
surface, increase the mesh size (to, say, 25) like this: 


set isosamples 25 


Putting all this together, we get the following graph with these commands: 


set zeroaxis 

set view 60,120,1,1 

set xlabel "x" 

set ylabel "y" 

set zlabel "z" 

set contour both 

set isosamples 25 

splot [-1:1][-2:2] 2*(x**%2) + y**%2, exp(x+y) 
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2X H#2+Y# #2 


exp(x + y) 
20 
15 
س ےی‎ 10 


5 


PORES! 
EEE 
ETT 

کک کو و 


2 ےک 
کے کی کے 


The numbers listed below the functions in the key in the upper right corner of the graph 
are the “levels” of the level curves of the corresponding surface. That is, they are the num- 
bers c such that f(x,y) = c. Because of the large number of level curves, the key was put 
outside the graph with the set key outside command. If you do not want the function key 
displayed, it can be turned off with this command: unset key 


PARAMETRIC FUNCTIONS 


Gnuplot has the ability to graph surfaces given in various parametric forms. For example, 
for a surface parametrized in cylindrical coordinates 


x=rcos0, y=rsin0, z=zg 


you would do the following: 


set mapping cylindrical 
set parametric 


splot [a:b][c:d] vx*<cos(u),vx*sin(u),f(uU,V) 


where the variable u represents Û0, with a <= u = b, the variable u represents r, with c su sd, 
and z = f(u,v) is some function of u and u. 


Example C.2. The graph of the helicoid z = 0 in Example 1.34 from Section 1.7 (p. 49) was 
created using the following commands: 
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set mapping cylindrical 

set parametric 

set view 60,120,1,1 

set xyplane 0 

set xlabel "x" 

set ylabel "y" 

set zlabel "z" 

unset key 

set isosamples 15 

splot [0:4*pi] [0:2] vxcos(u),vx*sin(u),Uu 


The command set xyplane 0 moves the z-axis so that z = 0 aligns with the xy-plane (which 
is not the default in Gnuplot). Looking at the graph, you will see that r varies from 0 to 2, 
and 0 varies from 0 to 47. 


PRINTING AND SAVING 


In Windows, to print a graph from Gnuplot right-click on the titlebar of the graph’s window, 
select “Options” and then the “Print...” option. If that does not work on your version of 
Gnuplot, then go to the File menu on the main Gnuplot menubar, select “Output Device ...”, 
and enter pdf in the Terminal type? textfield, hit OK. That will allow you to print the graph 
as a PDF file. 

To save a graph, say, as a PNG file, go to the File menu on the main Gnuplot menubar, 
select “Output Device ...”, and enter png in the Terminal type? textfield, hit OK. Then, in the 
File menu again, select the “Output ...” option and enter a filename (say, graph.png) in the 
Output filename? textfield, hit OK. Now run your splot command again and you should see 
a file called graph.png in the current directory (usually the directory where wgnuplot.exe is 
located, though you can change that setting using the “Change Directory ...” option in the 
File menu). 


In Linux, to save the graph as a file called graph.png, you would issue the following com- 
mands: 


set terminal png 

set output ’graph.png’ 
and then run your splot command. There are many terminal types (which determine the 
output format). Run the command set terminal to see all the possible types. In Linux, 
the postscript terminal type is popular, since the print quality is high and there are many 
PostScript viewers available. 


To quit Gnuplot, type quit at the gnuplot> command prompt. 
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Version 1.2, November 2002 
Copyright ©2000,2001,2002 Free Software Foundation, Inc. 


51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA 


Everyone is permitted to copy and distribute verbatim copies of this license document, but 
changing it is not allowed. 


Preamble 


The purpose of this License is to make a manual, textbook, or other functional and useful 
document "free" in the sense of freedom: to assure everyone the effective freedom to copy 
and redistribute it, with or without modifying it, either commercially or noncommercially. 
Secondarily, this License preserves for the author and publisher a way to get credit for their 
work, while not being considered responsible for modifications made by others. 

This License is a kind of "copyleft", which means that derivative works of the document 
must themselves be free in the same sense. It complements the GNU General Public License, 
which is a copyleft license designed for free software. 

We have designed this License in order to use it for manuals for free software, because free 
software needs free documentation: a free program should come with manuals providing the 
same freedoms that the software does. But this License is not limited to software manuals; it 
can be used for any textual work, regardless of subject matter or whether it is published as a 
printed book. We recommend this License principally for works whose purpose is instruction 
or reference. 


1. APPLICABILITY AND DEFINITIONS 


This License applies to any manual or other work, in any medium, that contains a notice 
placed by the copyright holder saying it can be distributed under the terms of this License. 
Such a notice grants a world-wide, royalty-free license, unlimited in duration, to use that 
work under the conditions stated herein. The "Document", below, refers to any such manual 
or work. Any member of the public is a licensee, and is addressed as "you". You accept 
the license if you copy, modify or distribute the work in a way requiring permission under 
copyright law. 

A "Modified Version" of the Document means any work containing the Document or a 
portion of it, either copied verbatim, or with modifications and/or translated into another 
language. 
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A "Secondary Section" is a named appendix or a front-matter section of the Document 
that deals exclusively with the relationship of the publishers or authors of the Document 
to the Document’s overall subject (or to related matters) and contains nothing that could 
fall directly within that overall subject. (Thus, if the Document is in part a textbook of 
mathematics, a Secondary Section may not explain any mathematics.) The relationship 
could be a matter of historical connection with the subject or with related matters, or of 
legal, commercial, philosophical, ethical or political position regarding them. 

The "Invariant Sections" are certain Secondary Sections whose titles are designated, 
as being those of Invariant Sections, in the notice that says that the Document is released 
under this License. If a section does not fit the above definition of Secondary then it is not 
allowed to be designated as Invariant. The Document may contain zero Invariant Sections. 
If the Document does not identify any Invariant Sections then there are none. 

The "Cover Texts" are certain short passages of text that are listed, as Front-Cover Texts 
or Back-Cover Texts, in the notice that says that the Document is released under this Li- 
cense. A Front-Cover Text may be at most Š5 words, and a Back-Cover Text may be at most 
25 words. 

A "Transparent" copy of the Document means a machine-readable copy, represented in 
a format whose specification is available to the general public, that is suitable for revising 
the document straightforwardly with generic text editors or (for images composed of pixels) 
generic paint programs or (for drawings) some widely available drawing editor, and that 
is suitable for input to text formatters or for automatic translation to a variety of formats 
suitable for input to text formatters. A copy made in an otherwise Transparent file format 
whose markup, or absence of markup, has been arranged to thwart or discourage subsequent 
modification by readers is not Transparent. An image format is not Transparent if used for 
any substantial amount of text. A copy that is not "Transparent" is called "Opaque'. 

Examples of suitable formats for Transparent copies include plain ASCII without markup, 
Texinfo input format, LaTeX input format, SGML or XML using a publicly available DTD, 
and standard-conforming simple HTML, PostScript or PDF designed for human modifica- 
tion. Examples of transparent image formats include PNG, XCF and JPG. Opaque formats 
include proprietary formats that can be read and edited only by proprietary word proces- 
sors, SGML or XML for which the DTD and/or processing tools are not generally available, 
and the machine-generated HTML, PostScript or PDF produced by some word processors for 
output purposes only. 

The "Title Page" means, for a printed book, the title page itself, plus such following pages 
as are needed to hold, legibly, the material this License requires to appear in the title page. 
For works in formats which do not have any title page as such, "Title Page" means the text 
near the most prominent appearance of the work's title, preceding the beginning of the body 
of the text. 

A section "Entitled XYZ" means a named subunit of the Document whose title either 
is precisely XYZ or contains XYZ in parentheses following text that translates XYZ in an- 
other language. (Here XYZ stands for a specific section name mentioned below, such as 
"Acknowledgments", "Dedications", "Endorsements", or "History".) To "Preserve the 


203 


Title" of such a section when you modify the Document means that it remains a section 
"Entitled XYZ" according to this definition. 

The Document may include Warranty Disclaimers next to the notice which states that 
this License applies to the Document. These Warranty Disclaimers are considered to be 
included by reference in this License, but only as regards disclaiming warranties: any other 
implication that these Warranty Disclaimers may have is void and has no effect on the 
meaning of this License. 


2. VERBATIM COPYING 


You may copy and distribute the Document in any medium, either commercially or non- 
commercially, provided that this License, the copyright notices, and the license notice say- 
ing this License applies to the Document are reproduced in all copies, and that you add no 
other conditions whatsoever to those of this License. You may not use technical measures to 
obstruct or control the reading or further copying of the copies you make or distribute. How- 
ever, you may accept compensation in exchange for copies. If you distribute a large enough 
number of copies you must also follow the conditions in section 3. 

You may also lend copies, under the same conditions stated above, and you may publicly 
display copies. 


3. COPYING IN QUANTITY 


If you publish printed copies (or copies in media that commonly have printed covers) of 
the Document, numbering more than 100, and the Document'’s license notice requires Cover 
Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover 
Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on the back cover. Both 
covers must also clearly and legibly identify you as the publisher of these copies. The front 
cover must present the full title with all words of the title equally prominent and visible. 
You may add other material on the covers in addition. Copying with changes limited to the 
covers, as long as they preserve the title of the Document and satisfy these conditions, can 
be treated as verbatim copying in other respects. 

If the required texts for either cover are too voluminous to fit legibly, you should put the 
first ones listed (as many as fit reasonably) on the actual cover, and continue the rest onto 
adjacent pages. 

If you publish or distribute Opaque copies of the Document numbering more than 100, 
you must either include a machine-readable Transparent copy along with each Opaque copy, 
or state in or with each Opaque copy a computer-network location from which the general 
network-using public has access to download using public-standard network protocols a com- 
plete Transparent copy of the Document, free of added material. If you use the latter option, 
you must take reasonably prudent steps, when you begin distribution of Opaque copies in 
quantity, to ensure that this Transparent copy will remain thus accessible at the stated lo- 
cation until at least one year after the last time you distribute an Opaque copy (directly or 
through your agents or retailers) of that edition to the public. 
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It is requested, but not required, that you contact the authors of the Document well before 
redistributing any large number of copies, to give them a chance to provide you with an 
updated version of the Document. 


4. MODIFICATIONS 


You may copy and distribute a Modified Version of the Document under the conditions of 
sections 2 and 3 above, provided that you release the Modified Version under precisely this 
License, with the Modified Version filling the role of the Document, thus licensing distribu- 
tion and modification of the Modified Version to whoever possesses a copy of it. In addition, 
you must do these things in the Modified Version: 


A. Usein the Title Page (and on the covers, if any) a title distinct from that of the Document, 
and from those of previous versions (which should, if there were any, be listed in the 
History section of the Document). You may use the same title as a previous version if the 
original publisher of that version gives permission. 


B. List on the Title Page, as authors, one or more persons or entities responsible for au- 
thorship of the modifications in the Modified Version, together with at least five of the 
principal authors of the Document (all of its principal authors, if it has fewer than five), 
unless they release you from this requirement. 


C. State on the Title page the name of the publisher of the Modified Version, as the pub- 
lisher. 


D. Preserve all the copyright notices of the Document. 


E. Add an appropriate copyright notice for your modifications adjacent to the other copy- 
right notices. 


F. Include, immediately after the copyright notices, a license notice giving the public per- 
mission to use the Modified Version under the terms of this License, in the form shown 
in the Addendum below. 


G. Preserve in that license notice the full lists of Invariant Sections and required Cover 
Texts given in the Document’s license notice. 


H. Include an unaltered copy of this License. 


I. Preserve the section Entitled "History", Preserve its Title, and add to it an item stating 
at least the title, year, new authors, and publisher of the Modified Version as given on the 
Title Page. If there is no section Entitled "History" in the Document, create one stating 
the title, year, authors, and publisher of the Document as given on its Title Page, then 
add an item describing the Modified Version as stated in the previous sentence. 
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J. Preserve the network location, if any, given in the Document for public access to a Trans- 
parent copy of the Document, and likewise the network locations given in the Document 
for previous versions it was based on. These may be placed in the "History" section. You 
may omit a network location for a work that was published at least four years before the 
Document itself, or if the original publisher of the version it refers to gives permission. 


K. For any section Entitled "Acknowledgments" or "Dedications", Preserve the Title of the 
section, and preserve in the section all the substance and tone of each of the contributor 
acknowledgments and/or dedications given therein. 


L. Preserve all the Invariant Sections of the Document, unaltered in their text and in their 
titles. Section numbers or the equivalent are not considered part of the section titles. 


M. Delete any section Entitled "Endorsements". Such a section may not be included in the 
Modified Version. 


N. Do not retitle any existing section to be Entitled "Endorsements" or to conflict in title 
with any Invariant Section. 


O. Preserve any Warranty Disclaimers. 


If the Modified Version includes new front-matter sections or appendices that qualify as 
Secondary Sections and contain no material copied from the Document, you may at your 
option designate some or all of these sections as invariant. To do this, add their titles to 
the list of Invariant Sections in the Modified Version’s license notice. These titles must be 
distinct from any other section titles. 

You may add a section Entitled "Endorsements", provided it contains nothing but endorse- 
ments of your Modified Version by various parties—for example, statements of peer review 
or that the text has been approved by an organization as the authoritative definition of a 
standard. 

You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 
25 words as a Back-Cover Text, to the end of the list of Cover Texts in the Modified Version. 
Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or 
through arrangements made by) any one entity. If the Document already includes a cover 
text for the same cover, previously added by you or by arrangement made by the same entity 
you are acting on behalf of, you may not add another; but you may replace the old one, on 
explicit permission from the previous publisher that added the old one. 

The author(s) and publisher(s) of the Document do not by this License give permission to 
use their names for publicity for or to assert or imply endorsement of any Modified Version. 


5. COMBINING DOCUMENTS 


You may combine the Document with other documents released under this License, under 
the terms defined in section 4 above for modified versions, provided that you include in the 
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combination all of the Invariant Sections of all of the original documents, unmodified, and 
list them all as Invariant Sections of your combined work in its license notice, and that you 
preserve all their Warranty Disclaimers. 

The combined work need only contain one copy of this License, and multiple identical In- 
variant Sections may be replaced with a single copy. If there are multiple Invariant Sections 
with the same name but different contents, make the title of each such section unique by 
adding at the end of it, in parentheses, the name of the original author or publisher of that 
section if known, or else a unique number. Make the same adjustment to the section titles 
in the list of Invariant Sections in the license notice of the combined work. 

In the combination, you must combine any sections Entitled "History" in the various orig- 
inal documents, forming one section Entitled "History"; likewise combine any sections En- 
titled "Acknowledgments", and any sections Entitled "Dedications". You must delete all 
sections Entitled "Endorsements". 


6. COLLECTIONS OF DOCUMENTS 


You may make a collection consisting of the Document and other documents released un- 
der this License, and replace the individual copies of this License in the various documents 
with a single copy that is included in the collection, provided that you follow the rules of this 
License for verbatim copying of each of the documents in all other respects. 

You may extract a single document from such a collection, and distribute it individually 
under this License, provided you insert a copy of this License into the extracted document, 
and follow this License in all other respects regarding verbatim copying of that document. 


1. AGGREGATION WITH INDEPENDENT WORKS 


A compilation of the Document or its derivatives with other separate and independent 
documents or works, in or on a volume of a storage or distribution medium, is called an "ag- 
gregate" if the copyright resulting from the compilation is not used to limit the legal rights 
of the compilation’s users beyond what the individual works permit. When the Document 
is included in an aggregate, this License does not apply to the other works in the aggregate 
which are not themselves derivative works of the Document. 

If the Cover Text requirement of section 3 is applicable to these copies of the Document, 
then if the Document is less than one half of the entire aggregate, the Document’s Cover 
Texts may be placed on covers that bracket the Document within the aggregate, or the elec- 
tronic equivalent of covers if the Document is in electronic form. Otherwise they must ap- 
pear on printed covers that bracket the whole aggregate. 


8. TRANSLATION 


Translation is considered a kind of modification, so you may distribute translations of the 
Document under the terms of section 4. Replacing Invariant Sections with translations re- 
quires special permission from their copyright holders, but you may include translations of 
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some or all Invariant Sections in addition to the original versions of these Invariant Sections. 
You may include a translation of this License, and all the license notices in the Document, 
and any Warranty Disclaimers, provided that you also include the original English version 
of this License and the original versions of those notices and disclaimers. In case of a dis- 
agreement between the translation and the original version of this License or a notice or 
disclaimer, the original version will prevail. 

If a section in the Document is Entitled "Acknowledgments", "Dedications", or "History", 
the requirement (section 4) to Preserve its Title (section 1) will typically require changing 
the actual title. 


9. TERMINATION 


You may not copy, modify, sublicense, or distribute the Document except as expressly pro- 
vided for under this License. Any other attempt to copy, modify, sublicense or distribute the 
Document is void, and will automatically terminate your rights under this License. How- 
ever, parties who have received copies, or rights, from you under this License will not have 
their licenses terminated so long as such parties remain in full compliance. 


10. FUTURE REVISIONS OF THIS LICENSE 


The Free Software Foundation may publish new, revised versions of the GNU Free Doc- 
umentation License from time to time. Such new versions will be similar in spirit to the 
present version, but may differ in detail to address new problems or concerns. See http:// 
www. gnu.org/copyleft/. 

Each version of the License is given a distinguishing version number. If the Document 
specifies that a particular numbered version of this License "or any later version" applies to 
it, you have the option of following the terms and conditions either of that specified version or 
of any later version that has been published (not as a draft) by the Free Software Foundation. 
If the Document does not specify a version number of this License, you may choose any 
version ever published (not as a draft) by the Free Software Foundation. 


ADDENDUM: How to use this License for your documents 


To use this License in a document you have written, include a copy of the License in the 
document and put the following copyright and license notices just after the title page: 


Copyright OYEAR YOUR NAME. Permission is granted to copy, distribute and/or 
modify this document under the terms of the GNU Free Documentation License, Ver- 
sion 1.2 or any later version published by the Free Software Foundation; with no 
Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the 
license is included in the section entitled "GNU Free Documentation License". 
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If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, replace the 
"with...Texts." line with this: 


with the Invariant Sections being LIST THEIR TITLES, with the Front-Cover Texts 
being LIST, and with the Back-Cover Texts being LIST. 


If you have Invariant Sections without Cover Texts, or some other combination of the 
three, merge those two alternatives to suit the situation. 

If your document contains nontrivial examples of program code, we recommend releas- 
ing these examples in parallel under your choice of free software license, such as the GNU 
General Public License, to permit their use in free software. 


History 


This section contains the revision history of the book. For persons making modifications to 
the book, please record the pertinent information here, following the format in the first item 
below. 


1. VERSION: 1.0 
Date: 2008-01-04 
Author(s): Michael Corral 
Title: Vector Calculus 
Modification(s): Initial version 
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